We will describe a project's architecture that aims to have a design that let scale functionalities without too much accidental complexity. The architecture picks ideas from DDD and CQRS.
This document provides an overview of Groovy and Grails. Groovy is a dynamic language for the Java Virtual Machine that focuses on programmer productivity. It integrates with Java libraries and compiles to Java bytecode. Grails is a web framework inspired by Ruby on Rails that uses conventions over configurations. It uses Groovy and integrates technologies like Hibernate, Spring, and more to provide features like object-relational mapping, scaffolding, and plugins.
This document provides tools and best practices for creating coastal web maps. It discusses options for data storage, including hosting locally or in the cloud. Common data formats like Shapefiles, GeoJSON and GeoTIFF are recommended. It also covers web mapping services like WMS, WFS and WMTS based on OGC standards. Popular client-side development options are presented, such as APIs from Google Maps and ArcGIS, and open-source libraries like OpenLayers and Leaflet. Resources for further information are provided.
Couchbase Lite is a NoSQL mobile database that uses a document data model with key-value pairs and handles data one document at a time. It supports push and pull replication for syncing documents between devices and servers, including both continuous and one-shot replication with options for persistent or non-persistent settings. The document provides details on Couchbase Lite's data structures, basic operations, replication features, and includes links to related resources and a demo app that uses Cloudant as the backend data layer.
This document discusses serverless computing using AWS Lambda. It provides an overview of Lambda's features like zero administration, auto-scaling, and pay per use model. It also discusses how API Gateway can be used to route requests to Lambda functions. The document compares performance and cost of Lambda to other platforms like Elastic Beanstalk. It provides examples of serverless application design where Lambda functions are triggered by events from databases or APIs. Finally, it briefly describes an open source project to implement Lambda-like functionality on OpenStack.
This document introduces Ballerina, an event-driven programming language for networked applications. It describes Ballerina as having textual and graphical syntaxes, being strongly typed, and being designed for modern development practices like microservices. Key features highlighted include deep integration with protocols like HTTP, JSON, and SQL, built-in support for parallelism, safety and resiliency features, and being open source.
Node.js server side render in the Age of APIs - Full Stack Toronto 2017Ruy Adorno
With the increasing popularity of Universal JavaScript and microservices, more and more front-end developers are taking ownership of their server-side render using Node.js.
In this talk we'll see how leveraging APIs in both server and client side is giving birth to new libraries and frameworks and what is the role of a front end developer in this post-full stack world.
Demo: https://github.com/ruyadorno/react-ssr-fsto-2017-demo
[WSO2Con EU 2017] Ballerina: Exploring Data IntegrationWSO2
Ballerina provides tools for data integration including connectors for popular data sources, data types for representation of different formats, and support for transactions. It allows for graphical data modeling, scheduled data jobs, and rapid development of REST APIs and data services. Ballerina's built-in connectors, types, and features make it suitable for tasks like ETL, data warehousing, and exposing data through services.
The document discusses how different protocols like REST, Kafka, GraphQL, gRPC, and mySQL can be made protocol-agnostic. It defines common attributes across protocols like scope, operation, sending and receiving data formats, asynchronous/streaming behavior, and connection and authentication settings. Making protocols protocol-agnostic provides benefits like a universal specification for documentation, collaboration between teams using different architectures, and a consistent user experience.
This document provides an overview of Groovy and Grails. Groovy is a dynamic language for the Java Virtual Machine that focuses on programmer productivity. It integrates with Java libraries and compiles to Java bytecode. Grails is a web framework inspired by Ruby on Rails that uses conventions over configurations. It uses Groovy and integrates technologies like Hibernate, Spring, and more to provide features like object-relational mapping, scaffolding, and plugins.
This document provides tools and best practices for creating coastal web maps. It discusses options for data storage, including hosting locally or in the cloud. Common data formats like Shapefiles, GeoJSON and GeoTIFF are recommended. It also covers web mapping services like WMS, WFS and WMTS based on OGC standards. Popular client-side development options are presented, such as APIs from Google Maps and ArcGIS, and open-source libraries like OpenLayers and Leaflet. Resources for further information are provided.
Couchbase Lite is a NoSQL mobile database that uses a document data model with key-value pairs and handles data one document at a time. It supports push and pull replication for syncing documents between devices and servers, including both continuous and one-shot replication with options for persistent or non-persistent settings. The document provides details on Couchbase Lite's data structures, basic operations, replication features, and includes links to related resources and a demo app that uses Cloudant as the backend data layer.
This document discusses serverless computing using AWS Lambda. It provides an overview of Lambda's features like zero administration, auto-scaling, and pay per use model. It also discusses how API Gateway can be used to route requests to Lambda functions. The document compares performance and cost of Lambda to other platforms like Elastic Beanstalk. It provides examples of serverless application design where Lambda functions are triggered by events from databases or APIs. Finally, it briefly describes an open source project to implement Lambda-like functionality on OpenStack.
This document introduces Ballerina, an event-driven programming language for networked applications. It describes Ballerina as having textual and graphical syntaxes, being strongly typed, and being designed for modern development practices like microservices. Key features highlighted include deep integration with protocols like HTTP, JSON, and SQL, built-in support for parallelism, safety and resiliency features, and being open source.
Node.js server side render in the Age of APIs - Full Stack Toronto 2017Ruy Adorno
With the increasing popularity of Universal JavaScript and microservices, more and more front-end developers are taking ownership of their server-side render using Node.js.
In this talk we'll see how leveraging APIs in both server and client side is giving birth to new libraries and frameworks and what is the role of a front end developer in this post-full stack world.
Demo: https://github.com/ruyadorno/react-ssr-fsto-2017-demo
[WSO2Con EU 2017] Ballerina: Exploring Data IntegrationWSO2
Ballerina provides tools for data integration including connectors for popular data sources, data types for representation of different formats, and support for transactions. It allows for graphical data modeling, scheduled data jobs, and rapid development of REST APIs and data services. Ballerina's built-in connectors, types, and features make it suitable for tasks like ETL, data warehousing, and exposing data through services.
The document discusses how different protocols like REST, Kafka, GraphQL, gRPC, and mySQL can be made protocol-agnostic. It defines common attributes across protocols like scope, operation, sending and receiving data formats, asynchronous/streaming behavior, and connection and authentication settings. Making protocols protocol-agnostic provides benefits like a universal specification for documentation, collaboration between teams using different architectures, and a consistent user experience.
Flink Forward San Francisco 2018: Ken Krugler - "Building a scalable focused ...Flink Forward
Is it possible to build an efficient, focused web crawler using Flink? That was the question that led to the creation of the flink-crawler open source project. In this talk I’ll discuss how we use Flink’s support for AsyncFunctions and iterations to create a scalable web crawler that continuously and efficiently performs a focused web crawl with no additional infrastructure. I’ll also discuss some of the testing and debugging challenges encountered when using features such as AsyncFunctions and iterations.
This document provides an introduction to RESTful web services. It defines REST and its architectural principles including addressing resources with URIs, using a uniform and constrained interface, focusing on representations of resources, stateless communication, and hypermedia as the engine of application state. It discusses implementing RESTful services with Java using JAX-RS annotations and frameworks like Jersey and CXF. It also covers testing REST APIs, content negotiation, error handling, versioning, and security considerations.
How we place Ruby Hanami from a PoC into a Production ready application at BankFacil.
This talk is Part of First Hanami SP Meetup in Sao Paulo/Brazil
Follow: #hanamirb_sp / #hanamirb
Cataloging with OCLC WMS, February 2014Jeff Siemon
Pros and cons of three workflows for catalogers using WMS, as of February 2014. Using Connection Client, Connection Browser, or WMS Record Manager, and various combinations
This XML Prague 2015 Pre-conference presentations shows practical usage of linked data sources. These sources can help to: enrich content with entities, add link to external data sources, use the enriched content in question answering, machine translation or other scenarios. The aim is to show the practical application of linked data sources in XML tooling. The presentation is an update and provides outcomes of the related session held at XML Prague 2014.
The document discusses the Rails asset pipeline and Sprockets gem. It summarizes that the asset pipeline packages and minifies JavaScript and CSS assets, manages dependencies between assets, and provides a preprocessor pipeline. It also describes Sprockets directives like //= require that search for and include assets. Gems like turbo-sprockets-rails3 and quiet_assets can improve the asset pipeline by speeding up asset precompilation and hiding asset requests in logs.
This document discusses bringing internet data content into GIS applications. It describes how GIS applications can run on desktops, web browsers, and mobile devices. It also discusses different approaches for accessing internet data, including hyperlinks, feeds like GeoRSS, and services. It provides examples of different types of services like map, feature, geoprocessing, and specialized services and how they can be accessed and used.
[WSO2Con EU 2017] File Processing and Websockets with BallerinaWSO2
File processing is used in almost every enterprise IT system at different scales and across different domains. It spans across domains such as processing log files, audit files, data files, and binary files. Ballerina is designed for integration and it comes with a set of core features to implement different file handling use cases. This slide deck discusses file processing and Websockets with Ballerina.
This presentation gives an overview of the Apache Gobblin project. It explains Apache Gobblin in terms of it's architecture, data sources/sinks and it's work unit processing.
Links for further information and connecting
http://www.amazon.com/Michael-Frampton/e/B00NIQDOOM/
https://nz.linkedin.com/pub/mike-frampton/20/630/385
https://open-source-systems.blogspot.com/
CouchDB and PouchDB are document databases that allow for offline data access and synchronization. CouchDB uses replication to enable redundant data storage and load balancing across multiple database nodes. It supports eventual consistency, validation of documents with JavaScript, and incremental replication with automatic conflict detection and resolution. PouchDB is a JavaScript library that synchronizes data with CouchDB and provides an abstraction layer, allowing applications to work with data offline or online in a CouchDB-compatible way.
This document discusses stream processing using Kafka and Samza. It notes that Samza provides low-latency, high-throughput stream processing and tightly integrates with Kafka. Samza handles stream processing, local state storage, and has a simple API. It also notes that at LinkedIn, Kafka is used extensively to handle hundreds of brokers and terabytes of messaging data daily.
HTML 5 Presentation provides an overview of HTML5, including what it is, its new semantic elements for page layout and markup, CSS3 features, support for multimedia and graphics, device access capabilities, performance improvements, offline storage and connectivity features. It also discusses HTML5 pros and cons, and provides useful resources for further information.
This document provides an introduction and overview of using Power BI for reporting in SharePoint Online. It discusses Power BI capabilities for connecting to data, designing dashboards and reports, and publishing for consumption. Key takeaways include how to connect Power BI to SharePoint and other data sources via REST APIs, transform JSON responses into tables for analysis, design interactive visualizations and dashboards, and publish reports to the Power BI service for viewing.
The document discusses single page applications (SPAs) and the AngularJS framework. It defines SPAs as web apps that load on a single page to provide a more fluid user experience. It then explains key characteristics of SPAs and benefits like improved performance. The bulk of the document focuses on introducing concepts in AngularJS like controllers, scopes, templates, directives, filters, services and routing. It provides code examples to demonstrate basic usage and building a todo list app with AngularJS.
Kafka meetup seattle 2019 mirus reliable, high performance replication for ap...Nitin Kumar
Mirus is a tool developed by Salesforce to replicate data between Apache Kafka clusters at scale. It is based on Kafka Connect and provides dynamic configuration, monitoring of topics and partitions, and improved resilience over the default Mirror Maker tool. Mirus handles reliable replication of data between multiple global data centers with minimal latency and data duplication.
The document proposes a Semantic DESCription as a Service (SemDESCaaS) concept to enable semantic annotations for resources independent of their type. It extends the existing DESCaaS concept to generate semantic descriptions using a Resource Model Translator. SemDESCaaS implementations would be Web services that provide interlinked semantic descriptions in ontologies and WSDL formats for any resource via URLs following a pattern. The concept is conceptualized and future work involves prototyping it and adapting it to additional use cases.
Using Azure Mobile Services in Windows Store XAML ApplicationsSergey Barskiy
Talk at DevLink 2013
This session will cover the basics of creating a mobile service, exposing the data through the service and consuming this data in a Windows Store application. We will cover CRUD operations and push notifications as well as securing of the services. Learning objectives 1. Create a mobile service that exposes data to the applications 2. Securing the data 3. Using services for push notifications.
Google App Engine is a Platform as a Service (PaaS) cloud computing platform that allows developers to build and host web applications in Google's data centers. It provides a scalable and reliable environment for developing applications using popular languages like Java, Python, PHP, and Go. App Engine handles tasks like provisioning servers and managing traffic so developers can focus on their code. It also includes services for storage, mail delivery, caching, and accessing web resources. App Engine is well-suited for applications with unpredictable traffic spikes or those where developers don't want to manage their own servers.
This document provides an overview of the SAP ABAP Online Training course offered by MonsterCourses.com. The 50-hour course covers topics such as SAP architecture, ASAP methodology, ABAP fundamentals, database tables, forms, reports, user exits, and more. It aims to provide skills in various ABAP concepts as well as functional exposure through real-time scenarios in areas like materials management and supplier relationship management. The course is available online through MonsterCourses.com.
Deploying Immutable infrastructures with RabbitMQ and SolrJordi Llonch
This document discusses deploying immutable infrastructures for RabbitMQ and Solr clusters. It describes how to deploy a new RabbitMQ cluster using federated queues to migrate services from the old to new cluster with zero downtime. For Solr, it explains how to deploy a new cluster and reindex data from the old cluster using double near real-time indexing before switching search traffic over. Maintaining both clusters allows for A/B testing, performance testing, and functional testing of new configurations without impacting real users.
The document introduces Erlang/OTP as a solution for building scalable and fault-tolerant systems. It discusses how Erlang/OTP uses processes, message passing, and supervision to enable high concurrency and high availability. It also provides an overview of OTP's basic architecture patterns like supervisors, servers, and finite state machines that improve code design and fault handling.
Flink Forward San Francisco 2018: Ken Krugler - "Building a scalable focused ...Flink Forward
Is it possible to build an efficient, focused web crawler using Flink? That was the question that led to the creation of the flink-crawler open source project. In this talk I’ll discuss how we use Flink’s support for AsyncFunctions and iterations to create a scalable web crawler that continuously and efficiently performs a focused web crawl with no additional infrastructure. I’ll also discuss some of the testing and debugging challenges encountered when using features such as AsyncFunctions and iterations.
This document provides an introduction to RESTful web services. It defines REST and its architectural principles including addressing resources with URIs, using a uniform and constrained interface, focusing on representations of resources, stateless communication, and hypermedia as the engine of application state. It discusses implementing RESTful services with Java using JAX-RS annotations and frameworks like Jersey and CXF. It also covers testing REST APIs, content negotiation, error handling, versioning, and security considerations.
How we place Ruby Hanami from a PoC into a Production ready application at BankFacil.
This talk is Part of First Hanami SP Meetup in Sao Paulo/Brazil
Follow: #hanamirb_sp / #hanamirb
Cataloging with OCLC WMS, February 2014Jeff Siemon
Pros and cons of three workflows for catalogers using WMS, as of February 2014. Using Connection Client, Connection Browser, or WMS Record Manager, and various combinations
This XML Prague 2015 Pre-conference presentations shows practical usage of linked data sources. These sources can help to: enrich content with entities, add link to external data sources, use the enriched content in question answering, machine translation or other scenarios. The aim is to show the practical application of linked data sources in XML tooling. The presentation is an update and provides outcomes of the related session held at XML Prague 2014.
The document discusses the Rails asset pipeline and Sprockets gem. It summarizes that the asset pipeline packages and minifies JavaScript and CSS assets, manages dependencies between assets, and provides a preprocessor pipeline. It also describes Sprockets directives like //= require that search for and include assets. Gems like turbo-sprockets-rails3 and quiet_assets can improve the asset pipeline by speeding up asset precompilation and hiding asset requests in logs.
This document discusses bringing internet data content into GIS applications. It describes how GIS applications can run on desktops, web browsers, and mobile devices. It also discusses different approaches for accessing internet data, including hyperlinks, feeds like GeoRSS, and services. It provides examples of different types of services like map, feature, geoprocessing, and specialized services and how they can be accessed and used.
[WSO2Con EU 2017] File Processing and Websockets with BallerinaWSO2
File processing is used in almost every enterprise IT system at different scales and across different domains. It spans across domains such as processing log files, audit files, data files, and binary files. Ballerina is designed for integration and it comes with a set of core features to implement different file handling use cases. This slide deck discusses file processing and Websockets with Ballerina.
This presentation gives an overview of the Apache Gobblin project. It explains Apache Gobblin in terms of it's architecture, data sources/sinks and it's work unit processing.
Links for further information and connecting
http://www.amazon.com/Michael-Frampton/e/B00NIQDOOM/
https://nz.linkedin.com/pub/mike-frampton/20/630/385
https://open-source-systems.blogspot.com/
CouchDB and PouchDB are document databases that allow for offline data access and synchronization. CouchDB uses replication to enable redundant data storage and load balancing across multiple database nodes. It supports eventual consistency, validation of documents with JavaScript, and incremental replication with automatic conflict detection and resolution. PouchDB is a JavaScript library that synchronizes data with CouchDB and provides an abstraction layer, allowing applications to work with data offline or online in a CouchDB-compatible way.
This document discusses stream processing using Kafka and Samza. It notes that Samza provides low-latency, high-throughput stream processing and tightly integrates with Kafka. Samza handles stream processing, local state storage, and has a simple API. It also notes that at LinkedIn, Kafka is used extensively to handle hundreds of brokers and terabytes of messaging data daily.
HTML 5 Presentation provides an overview of HTML5, including what it is, its new semantic elements for page layout and markup, CSS3 features, support for multimedia and graphics, device access capabilities, performance improvements, offline storage and connectivity features. It also discusses HTML5 pros and cons, and provides useful resources for further information.
This document provides an introduction and overview of using Power BI for reporting in SharePoint Online. It discusses Power BI capabilities for connecting to data, designing dashboards and reports, and publishing for consumption. Key takeaways include how to connect Power BI to SharePoint and other data sources via REST APIs, transform JSON responses into tables for analysis, design interactive visualizations and dashboards, and publish reports to the Power BI service for viewing.
The document discusses single page applications (SPAs) and the AngularJS framework. It defines SPAs as web apps that load on a single page to provide a more fluid user experience. It then explains key characteristics of SPAs and benefits like improved performance. The bulk of the document focuses on introducing concepts in AngularJS like controllers, scopes, templates, directives, filters, services and routing. It provides code examples to demonstrate basic usage and building a todo list app with AngularJS.
Kafka meetup seattle 2019 mirus reliable, high performance replication for ap...Nitin Kumar
Mirus is a tool developed by Salesforce to replicate data between Apache Kafka clusters at scale. It is based on Kafka Connect and provides dynamic configuration, monitoring of topics and partitions, and improved resilience over the default Mirror Maker tool. Mirus handles reliable replication of data between multiple global data centers with minimal latency and data duplication.
The document proposes a Semantic DESCription as a Service (SemDESCaaS) concept to enable semantic annotations for resources independent of their type. It extends the existing DESCaaS concept to generate semantic descriptions using a Resource Model Translator. SemDESCaaS implementations would be Web services that provide interlinked semantic descriptions in ontologies and WSDL formats for any resource via URLs following a pattern. The concept is conceptualized and future work involves prototyping it and adapting it to additional use cases.
Using Azure Mobile Services in Windows Store XAML ApplicationsSergey Barskiy
Talk at DevLink 2013
This session will cover the basics of creating a mobile service, exposing the data through the service and consuming this data in a Windows Store application. We will cover CRUD operations and push notifications as well as securing of the services. Learning objectives 1. Create a mobile service that exposes data to the applications 2. Securing the data 3. Using services for push notifications.
Google App Engine is a Platform as a Service (PaaS) cloud computing platform that allows developers to build and host web applications in Google's data centers. It provides a scalable and reliable environment for developing applications using popular languages like Java, Python, PHP, and Go. App Engine handles tasks like provisioning servers and managing traffic so developers can focus on their code. It also includes services for storage, mail delivery, caching, and accessing web resources. App Engine is well-suited for applications with unpredictable traffic spikes or those where developers don't want to manage their own servers.
This document provides an overview of the SAP ABAP Online Training course offered by MonsterCourses.com. The 50-hour course covers topics such as SAP architecture, ASAP methodology, ABAP fundamentals, database tables, forms, reports, user exits, and more. It aims to provide skills in various ABAP concepts as well as functional exposure through real-time scenarios in areas like materials management and supplier relationship management. The course is available online through MonsterCourses.com.
Deploying Immutable infrastructures with RabbitMQ and SolrJordi Llonch
This document discusses deploying immutable infrastructures for RabbitMQ and Solr clusters. It describes how to deploy a new RabbitMQ cluster using federated queues to migrate services from the old to new cluster with zero downtime. For Solr, it explains how to deploy a new cluster and reindex data from the old cluster using double near real-time indexing before switching search traffic over. Maintaining both clusters allows for A/B testing, performance testing, and functional testing of new configurations without impacting real users.
The document introduces Erlang/OTP as a solution for building scalable and fault-tolerant systems. It discusses how Erlang/OTP uses processes, message passing, and supervision to enable high concurrency and high availability. It also provides an overview of OTP's basic architecture patterns like supervisors, servers, and finite state machines that improve code design and fault handling.
El agua y los vegetales: El plancton y su desarrollo en el agua. Plantas acuáticas.
El agua y los animales acuáticos: Animales de agua dulce. Animales de aguas saladas.
Un nuevo Parque Extremo se inauguró en Loja que cuenta con una pista de 2 km para quads y UTV. La pista fue construida para brindar seguridad y acceso a primeros auxilios. El parque se construyó en una antigua hacienda para fomentar el deporte y turismo en la ciudad.
Dokumen tersebut membahas perubahan struktur ekonomi Indonesia dari yang bersifat agraris menjadi industrialisasi, meskipun belum sepenuhnya. Terjadi pergeseran kontribusi sektor pertanian ke sektor industri, namun penyerapan tenaga kerja masih didominasi sektor pertanian. Struktur ekonomi juga bergeser dari pedesaan ke perkotaan seiring pembangunan.
This document provides information about best practices for governance, the TeamMates mentoring program, and board committee roles. It discusses TeamMates' mission, vision, values and strategic initiatives to positively impact youth through mentoring. The six foundations of an effective team are outlined as trust, commitment, accountability, ownership, goals and communication. Guidelines are presented for board committee structure and operation, including defining goals and reviewing activities. Ethical practices for staffing, marketing, consumer satisfaction and privacy are also summarized.
This document provides guidance for celebrating a 25th anniversary. It outlines when and where new logos can be used for the anniversary, which is from now until July 2017. It directs the reader to a toolkit on the Playbook for logo examples, templates, and ideas on how mentors, mentees, and the public can be celebrated, such as through gifts, events, and media outreach. The document concludes by asking the reader to share one celebration idea for their chapter and provides a contact for any customization needs.
This document provides guidance to local chapters on establishing scholarship programs for mentees. It discusses TeamMates Central Office scholarships awarded from 2006-present. Local chapters are encouraged to create scholarship handbooks, which provide details on scholarship committees, eligibility guidelines, applications, scoring rubrics, and payment procedures. Chapters can fund scholarships through annual operating budgets, special fundraising campaigns, and endowments managed by the TeamMates Foundation. Donor stewardship tactics are also presented to recognize scholarship donors and sustain funding.
This document provides an overview of the organizational structure, policies, procedures, and resources for a school-based mentoring program called TeamMates. It covers the program's elements and standards for recruiting mentors and mentees, screening applicants, providing training, making matches, offering ongoing support, and facilitating closures. The document also addresses fundraising, data collection, and special events.
Introduction to DI Engineering Explorer for Financial ServicesDrillinginfo
DI Engineering Explorer is a proprietary tool that allows customers to visualize completion and production data from over 130,000 wells across 14 states. It saves customers over $800,000 and a year of man-hours by providing well-level analysis and insights. The tool finds correlations between completion techniques and production metrics and identifies the materials used in the most productive wells.
The Secrets of Building Realtime Big Data Systemsnathanmarz
The architectural principles behind building systems that scale to vast amounts of data and operate on that data in realtime.
Presented at POSSCON '11.
ClojuTRE2015: Kekkonen - making your Clojure web APIs more awesomeMetosin Oy
Thanks to REST and Swagger, we can build beautiful apis to feed both our browser front-ends and external applications. But, wrapping your Clojure code into resources mostly for your ClojureScript front-end doesn't feel right? Just use RPC? Meet in the middle?
Kekkonen is a small library for managing your (web) apis as commands and queries. No magic, data-driven, un-restful and non-rpc. It's goals are to be small, explicit, extendable and to help enforce your business rules both on the server side and on the ClojureScript frontend. Besides Swagger, it provides run-time context-aware apidocs for Clojure(Script).
This document provides an overview and introduction to Adobe Experience Manager (AEM), including its key components and architecture. It discusses AEM's content services, use of Sling and JCR technologies, and how content is stored and replicated using the Oak storage layer with implementations like TarMK and MongoMK. The document also provides information on AEM's RESTful principles, OSGi framework, and common deployment topologies.
Cask Webinar
Date: 08/10/2016
Link to video recording: https://www.youtube.com/watch?v=XUkANr9iag0
In this webinar, Nitin Motgi, CTO of Cask, walks through the new capabilities of CDAP 3.5 and explains how your organization can benefit.
Some of the highlights include:
- Enterprise-grade security - Authentication, authorization, secure keystore for storing configurations. Plus integration with Apache Sentry and Apache Ranger.
- Preview mode - Ability to preview and debug data pipelines before deploying them.
- Joins in Cask Hydrator - Capabilities to join multiple data sources in data pipelines
- Real-time pipelines with Spark Streaming - Drag & drop real-time pipelines using Spark Streaming.
- Data usage analytics - Ability to report application usage of data sets.
- And much more!
Google AutoML, AWS SageMaker and other ML tools automate some but not all steps in machine learning workflows. Learn about problem formulation, data engineering, monitoring, and fairness assessment.
A Collaborative Data Science Development WorkflowDatabricks
Collaborative data science workflows have several moving parts, and many organizations struggle with developing an efficient and scalable process. Our solution consists of data scientists individually building and testing Kedro pipelines and measuring performance using MLflow tracking. Once a strong solution is created, the candidate pipeline is trained on cloud-agnostic, GPU-enabled containers. If this pipeline is production worthy, the resulting model is served to a production application through MLflow.
Service-Oriented Design and Implement with Rails3Wen-Tien Chang
The service implements a RESTful Users web service with Rails 3. It customizes Rails to remove unnecessary components and optimize for a lightweight REST service. The service follows best practices for API design including using JSON format, placing JSON conversion in the controller, and returning appropriate HTTP status codes. Requests are designed to be stateless and atomic. Errors are returned in a standardized JSON format.
Databricks Meetup @ Los Angeles Apache Spark User GroupPaco Nathan
This document summarizes a presentation on Apache Spark and Spark Streaming. It provides an overview of Spark, describing it as an in-memory cluster computing framework. It then discusses Spark Streaming, explaining that it runs streaming computations as small batch jobs to provide low latency processing. Several use cases for Spark Streaming are presented, including from companies like Stratio, Pearson, Ooyala, and Sharethrough. The presentation concludes with a demonstration of Python Spark Streaming code.
"Introduction to Sparkling Water" — Jakub Hava, Senior Software Engineer, at ...Provectus
"Introduction to Sparkling Water" — Jakub Hava, Senior Software Engineer at H2O.ai.
Jakub Hava enjoys dealing with problems and learning new programming languages. At H2O.ai, Kuba works on Sparkling Water project and its integration with the rest of the H2O.ai ecosystem.
The speech includes a live demo showing how to create a Sparkling Water pipeline with H2O.ai's XGBoost model — no terminal needed, all we need is Jupyter!
Running Airflow Workflows as ETL Processes on Hadoopclairvoyantllc
While working with Hadoop, you'll eventually encounter the need to schedule and run workflows to perform various operations like ingesting data or performing ETL. There are a number of tools available to assist you with this type of requirement and one such tool that we at Clairvoyant have been looking to use is Apache Airflow. Apache Airflow is an Apache Incubator project that allows you to programmatically create workflows through a python script. This provides a flexible and effective way to design your workflows with little code and setup. In this talk, we will discuss Apache Airflow and how we at Clairvoyant have utilized it for ETL pipelines on Hadoop.
Mike Spicer is the lead architect for the IBM Streams team. In his presentation, Mike provides an overview of the many key new features available in IBM Streams V4.1. Simpler development, simpler management, and Spark integration are a few of the capabilities included in IBM Streams V4.1.
This document outlines a "featherweight stack" for lightweight gCube clients that avoids dependencies on the full gCore stack. It proposes standalone client libraries for discovery, resources, and calling services using just the JDK instead of Axis, Globus, and gCore. Specific topics covered include generating lightweight service stubs, binding resources with JAXB, executing predefined and free-form discovery queries, and providing modular client libraries as an alternative to gCore clients. The goal is to improve client usability, modularity, and independence from obsolete dependencies.
Spark as a Platform to Support Multi-Tenancy and Many Kinds of Data Applicati...Spark Summit
This document summarizes Uber's use of Spark as a data platform to support multi-tenancy and various data applications. Key points include:
- Uber uses Spark on YARN for resource management and isolation between teams/jobs. Parquet is used as the columnar file format for performance and schema support.
- Challenges include sharing infrastructure between many teams with different backgrounds and use cases. Spark provides a common platform.
- An Uber Development Kit (UDK) is used to help users get Spark jobs running quickly on Uber's infrastructure, with templates, defaults, and APIs for common tasks.
This document discusses the large-scale Drupal implementation for the Municipality of Copenhagen. It describes the client's requirements including handling thousands of websites on a multisite platform, scalability, reusability, content sharing between sites, and accessibility. It then explains the technical solutions used such as Aegir for the multisite platform, Gitlab for module development, and various Drupal modules for features like content sharing, editing interface, and performance optimization. Continuous integration, testing, and contributions back to the open source community are also covered.
Apache Arrow: Present and Future @ ScaledML 2020Wes McKinney
This document discusses Apache Arrow, an open source project that provides cross-language data structures and algorithms for efficient data analytics. It summarizes the history and goals of Arrow, provides examples of how it has been adopted, and outlines ongoing development initiatives. Key points include that Arrow aims to accelerate data processing by standardizing columnar data formats and protocols, it has seen widespread adoption with over 50M installs in 2019, and active areas of work include the C++ development platform and Arrow Flight RPC framework.
This document discusses various technologies related to architectures, frameworks, infrastructure, services, data stores, analytics, logging and metrics. It covers Java 8 features like lambda expressions and method references. It also discusses microservices, Spring Boot basics and features, Gradle vs Maven, Swagger, AngularJS, Gulp, Jasmine, Karma, Nginx, CloudFront, Couchbase, Lambda Architecture, logging with Fluentd and Elasticsearch, metrics collection with Collectd and Statsd, and visualization with Graphite and Grafana.
Simplifying Hadoop with RecordService, A Secure and Unified Data Access Path ...Cloudera, Inc.
SFHUG presentation from February 2, 2016. One of the key values of the Hadoop ecosystem is its flexibility. There is a myriad of components that make up this ecosystem, allowing Hadoop to tackle otherwise intractable problems. However, having so many components provides a significant integration, implementation, and usability burden. Features that ought to work in all the components often require sizable per-component effort to ensure correctness across the stack.
Lenni Kuff explores RecordService, a new solution to this problem that provides an API to read data from Hadoop storage managers and return them as canonical records. This eliminates the need for components to support individual file formats, handle security, perform auditing, and implement sophisticated IO scheduling and other common processing that is at the bottom of any computation.
Lenni discusses the architecture of the service and the integration work done for MapReduce and Spark. Many existing applications on those frameworks can take advantage of the service with little to no modification. Lenni demonstrates how this provides fine grain (column level and row level) security, through Sentry integration, and improves performance for existing MapReduce and Spark applications by up to 5×. Lenni concludes by discussing how this architecture can enable significant future improvements to the Hadoop ecosystem.
About the speaker: Lenni Kuff is an engineering manager at Cloudera. Before joining Cloudera, he worked at Microsoft on a number of projects including SQL Server storage engine, SQL Azure, and Hadoop on Azure. Lenni graduated from the University of Wisconsin-Madison with degrees in computer science and computer engineering.
Introducing Apache Kafka and why it is important to Oracle, Java and IT profe...Lucas Jellema
Events are playing an increasingly important role in modern application architecture. They represent fast, streaming data, they fuel the interaction between microservices, they are at the core of CQRS and event sourcing. Apache Kafka has quickly emerged as the de facto standard event platform: open source, cross technology, reliable and extremely scalable and available on any platform, in Docker and from the major cloud platforms- including Oracle Cloud’s Event Hub service. This session explains the what, why and how of Apache Kafka. What role does it play, how is it used and what are challenges and tricks for real life applications. How does it fit in with Oracle Database and Fusion Middleware and with Oracle Public Cloud? In several demos, Kafka is seen at work - in real time streaming event analysis through KSQL, in CQRS and microservices scenarios and with user interfaces updated in real time through events and HTML5 server sent events.
This presentation includes a demonstration of remote database synchronization through Twitter.
Similar to A production project's architecture with clojure (20)
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
“An Outlook of the Ongoing and Future Relationship between Blockchain Technologies and Process-aware Information Systems.” Invited talk at the joint workshop on Blockchain for Information Systems (BC4IS) and Blockchain for Trusted Data Sharing (B4TDS), co-located with with the 36th International Conference on Advanced Information Systems Engineering (CAiSE), 3 June 2024, Limassol, Cyprus.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/building-and-scaling-ai-applications-with-the-nx-ai-manager-a-presentation-from-network-optix/
Robin van Emden, Senior Director of Data Science at Network Optix, presents the “Building and Scaling AI Applications with the Nx AI Manager,” tutorial at the May 2024 Embedded Vision Summit.
In this presentation, van Emden covers the basics of scaling edge AI solutions using the Nx tool kit. He emphasizes the process of developing AI models and deploying them globally. He also showcases the conversion of AI models and the creation of effective edge AI pipelines, with a focus on pre-processing, model conversion, selecting the appropriate inference engine for the target hardware and post-processing.
van Emden shows how Nx can simplify the developer’s life and facilitate a rapid transition from concept to production-ready applications.He provides valuable insights into developing scalable and efficient edge AI solutions, with a strong focus on practical implementation.
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Zilliz
Join us to introduce Milvus Lite, a vector database that can run on notebooks and laptops, share the same API with Milvus, and integrate with every popular GenAI framework. This webinar is perfect for developers seeking easy-to-use, well-integrated vector databases for their GenAI apps.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
4. domain
• contexts
• modules
• application services
• value objects/aggregates
• interfaces for repositories and external services
• buses
• command bus
• oracle
• domain event publisher
5. infrastructure
• http rest api server
• rabbitmq workers
• key/value database
• relational database
• non blocking promises
6. libraries
• com.stuartsierra/component: managing the lifecycle and dependencies of software components which have runtime state
• co.paralleluniverse/pulsar: provides high-performance lightweight threads, Go-like channels, Erlang-like actors, and other
asynchronous programming tools
• prismatic/schema: declarative data description and validation
• funcool/catacumba: web toolkit
• com.taoensso/timbre: logging and profiling
• hikari-cp: high-performance JDBC connection pool
• slingshot: enhanced throw and catch for Clojure
• com.gearswithingears/shrubbery: mocking
• com.novemberain/langohr: RabbitMQ client
• com.taoensso/carmine: Redis client
15. • Stuart Sierra component
• lifecycle and dependencies management of runtime state
• rapid workflow
• component next
• adaptation of Stuart Sierra components map definition
• simplifiers
• bus handlers
• api rest routing
components