So, your business has already opened sourced some of it's code? Great! But now, someone's asking you about giving it to these Apache people? What's up with that, and why isn't just being open source enough?
In this talk, we'll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We'll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We'll also look briefly at where it may not be the right fit.
Wondering about how to take your business's open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you!
Alfresco provides many extensions points for building custom solutions, integrating with a multitude of different tools. Learn how Alfresco’s Integration Engineering team has approached building integrations with Jive and Dropbox. See what’s coming around the bend and how you can participate with community lead integration projects.
This document discusses various content integration projects between Alfresco and other systems. It provides an overview of Alfresco integrations with Jive and Dropbox. For the Jive integration, it describes how documents can be uploaded, published, or linked between the two systems while maintaining social features. For the Dropbox integration, it outlines plans to sync content and folders between Alfresco and Dropbox accounts. Other topics covered include implementation details, upcoming features, and a call for community contributions to other integration projects.
Markup languages and warp-speed documentationLois Patterson
The presentation discusses how software development has moved towards more frequent releases through DevOps practices. This requires documentation to also be updated quickly. Markup languages can help by allowing many contributors to collaborate easily on documentation. Specific markup languages mentioned include reStructuredText and Markdown, which can be processed by tools like Sphinx to generate documentation from plain text files. The presentation demonstrates how to use reStructuredText and emphasizes that markup languages, collaborative tools like GitHub, and automation are key to supporting modern rapid software development practices.
I gave this talk on IEEE Day (October 7, 2014). I covered Introduction to Open Source, Various Projects and Products in Open Source, What students can get from Open Source and various different aspects of Open Source during this talk.
Please feel free to download, modify and use the slides for your talks. Lets keep rocking the Free Web ! :)
Lois Patterson: Markup Languages and Warp-Speed DocumentationJack Molisani
The presentation discusses how software development has moved towards more frequent releases through DevOps practices. This requires documentation to also be updated quickly. Markup languages can help by allowing many contributors to collaborate easily on documentation. Specific markup languages mentioned include reStructuredText and Markdown, which can be processed by tools like Sphinx to generate documentation from plain text files. The presentation demonstrates how to use reStructuredText and emphasizes that markup languages, collaborative tools like GitHub, and automation are key to supporting modern rapid software development practices.
This document discusses strategies for creating an open source community as a software publisher. It recommends building community to get feedback, contributors, and evangelists which improves software quality. Key elements include choosing an open source license and development model, and using tools like websites, code repositories, issue trackers, and mailing lists for marketing, development, and governance. Community engagement is important through conferences, workshops, and being responsive to contributors.
This document provides an overview of open source software and recommendations for companies adopting open source. It discusses how open source can accelerate projects and attract talent. It profiles companies like Adobe, Netflix, Oracle, Samsung, and Microsoft that contribute to open source despite not being commonly associated with it. The document outlines how to launch an open source project, including using an open source license, README, contribution guidelines, and code of conduct. It also discusses roles in open source projects and various open source business models. The recommendations encourage companies to publish independent components on GitHub, take releases from GitHub, and create developer websites to engage with the open source community.
This document discusses the challenges of developing packages before and after the introduction of Composer. Before Composer, packages were tightly coupled to specific frameworks and became incompatible as frameworks evolved. This led to duplication of packages across frameworks. Composer addressed this by allowing packages to be decoupled from frameworks via interfaces and drivers. The document encourages developers to design packages that can integrate with multiple frameworks through bridging packages and drivers to strengthen interoperability and productivity.
Alfresco provides many extensions points for building custom solutions, integrating with a multitude of different tools. Learn how Alfresco’s Integration Engineering team has approached building integrations with Jive and Dropbox. See what’s coming around the bend and how you can participate with community lead integration projects.
This document discusses various content integration projects between Alfresco and other systems. It provides an overview of Alfresco integrations with Jive and Dropbox. For the Jive integration, it describes how documents can be uploaded, published, or linked between the two systems while maintaining social features. For the Dropbox integration, it outlines plans to sync content and folders between Alfresco and Dropbox accounts. Other topics covered include implementation details, upcoming features, and a call for community contributions to other integration projects.
Markup languages and warp-speed documentationLois Patterson
The presentation discusses how software development has moved towards more frequent releases through DevOps practices. This requires documentation to also be updated quickly. Markup languages can help by allowing many contributors to collaborate easily on documentation. Specific markup languages mentioned include reStructuredText and Markdown, which can be processed by tools like Sphinx to generate documentation from plain text files. The presentation demonstrates how to use reStructuredText and emphasizes that markup languages, collaborative tools like GitHub, and automation are key to supporting modern rapid software development practices.
I gave this talk on IEEE Day (October 7, 2014). I covered Introduction to Open Source, Various Projects and Products in Open Source, What students can get from Open Source and various different aspects of Open Source during this talk.
Please feel free to download, modify and use the slides for your talks. Lets keep rocking the Free Web ! :)
Lois Patterson: Markup Languages and Warp-Speed DocumentationJack Molisani
The presentation discusses how software development has moved towards more frequent releases through DevOps practices. This requires documentation to also be updated quickly. Markup languages can help by allowing many contributors to collaborate easily on documentation. Specific markup languages mentioned include reStructuredText and Markdown, which can be processed by tools like Sphinx to generate documentation from plain text files. The presentation demonstrates how to use reStructuredText and emphasizes that markup languages, collaborative tools like GitHub, and automation are key to supporting modern rapid software development practices.
This document discusses strategies for creating an open source community as a software publisher. It recommends building community to get feedback, contributors, and evangelists which improves software quality. Key elements include choosing an open source license and development model, and using tools like websites, code repositories, issue trackers, and mailing lists for marketing, development, and governance. Community engagement is important through conferences, workshops, and being responsive to contributors.
This document provides an overview of open source software and recommendations for companies adopting open source. It discusses how open source can accelerate projects and attract talent. It profiles companies like Adobe, Netflix, Oracle, Samsung, and Microsoft that contribute to open source despite not being commonly associated with it. The document outlines how to launch an open source project, including using an open source license, README, contribution guidelines, and code of conduct. It also discusses roles in open source projects and various open source business models. The recommendations encourage companies to publish independent components on GitHub, take releases from GitHub, and create developer websites to engage with the open source community.
This document discusses the challenges of developing packages before and after the introduction of Composer. Before Composer, packages were tightly coupled to specific frameworks and became incompatible as frameworks evolved. This led to duplication of packages across frameworks. Composer addressed this by allowing packages to be decoupled from frameworks via interfaces and drivers. The document encourages developers to design packages that can integrate with multiple frameworks through bridging packages and drivers to strengthen interoperability and productivity.
This document discusses how taxonomy and content delivery can improve the customer experience across multiple touchpoints. It provides examples of how a shared taxonomy can enable faceted search and content personalization in a documentation portal, knowledge base, and customer community. The document also outlines use cases for intelligent context-sensitive help, content entitlements for specific user roles, and delivering personalized technical documentation to IoT devices. Best practices highlighted include implementing an agile taxonomy process, analyzing customer journeys, and using a dynamic publishing system to support all customer touchpoints.
War stories from building the Global Patent Search Network, and why Data folks need to think more about UX and Discovery, and UX folks need to think more about Data.
This document provides a case study on a project created using open source technology. It discusses analyzing project goals and resources, evaluating open source options based on total cost of ownership, implementing a solution using LAMP stack, and lessons learned. The project was developed using Linux, Apache, MySQL, and PHP based on the needs of a low budget, ability to invest in internal skills, and reduce dependency on external trends. Key steps included preparing the Linux server, using version control and local testing, and engaging the open source community for support.
Stop Building It From Scratch: Creating Reusable eLearning ComponentsChad Udell
Is your organization constantly going through the same development path to produce your e-Learning... rehashing the same code, digging through line after line of spaghetti, and not seeing any real reuse benefits? If you want to leverage your intellectual property, and put your toolset to work for you, come ready to take note of practical tools, tips, and techniques you can employ immediately to enjoy productivity benefits.
In this session, you’ll learn secrets that Rapid Development gurus use. If you are tired of simple tools that speed up development, but really tie your hands when it comes to true customization, you need to learn more about application programming interfaces (APIs). The Web 2.0 world has largely been shaped by the emergence of Web services and the dominance of XML, yet so few e Learning development programs allow you to tap into those powerful tools. You’ll leave this session with ideas on how to build your own APIs, or use preexisting ones, right away.
In this session, you will learn:
* The benefits of building content in a reusable format
* Practical examples of reusable e-Learning concepts
* How to apply object-oriented development techniques to e-Learning
* What APIs are, and how you can use them to speed up development
* Techniques for designing your own e-Learning APIs
* Designing data schemas for flexibility
Audience:
Advanced designers and developers with basic programming skills in ActionScript, and who have edited XML. Deeper understanding of XML concepts and terminology will assist in greater understanding of this topic.
Cassandra Day NY 2014: Message Architectures in Distributed Systems at Simple...DataStax Academy
Eric will be presenting on SimpleReach's use of message architectures and why they an important part of a distributed system stack. They are often overlooked because the prevailing sentiment is that the storage and processing engines are the most important aspects of the system. Without the highways, the data won’t be able to get to its destination.
Summit 16: The Practice of Building OPNFV User Group in ChinaOPNFV
The Practice of Building OPNFV User Group in China(Liang(Kubi) Gao) - when we talking about community, community is about relationship. We developing community by developing relationship. So building user group is an important thing for OPNFV Community. In this presentation, Liang Gao will review recent OPNFV meetups in China. and introduce their practice to build user group in China, including(meetup.com, WeChat group, WeChat Official Accounts and so on) ,and will show the plan of building user group in China.
Navigating the Incubator at the Apache Software FoundationBrett Porter
The document discusses navigating the Apache Incubator process for open source projects. It explains that the Incubator helps projects join the Apache Foundation by ensuring they meet legal standards and the collaborative development model. Projects go through an incubation phase where they build community, follow procedures, and release versions before attempting to graduate. Not all projects will graduate but the Incubator addresses important issues for projects seeking to join Apache.
Core Principles of CI was a presentation that covered:
1) The definition of continuous integration as automated builds that run many times a day.
2) The key aspects of CI including source code compilation, database integration, testing, inspection, deployment, feedback, and producing deployable software.
3) The benefits of CI like reduced human error, rapid feedback cycles, increased project visibility and confidence, and facilitating communication.
Keeping the Content Train on the Tracks (And on Topic)Kristen Eberlein
How do you keep your content moving forward, while you are weighed down with baggage from the past? How do you change course when the tracks ahead seem to have washed away? Come hear the editors of the DITA 1.3 specification talk about creative strategies they put into place to streamline the process of developing a version of the specification, improve the quality and usability of the content, and get to their destination on schedule.
[Annotated] QConSF 2018: Airbnb's Great Migration - From Monolith to Service-...Jessica Tai
Airbnb migrated from a monolithic architecture with a single codebase to a service-oriented architecture (SOA) with microservices. As the engineering team grew rapidly from 90 to over 1000 people, the monolith became difficult to scale and maintain. Airbnb defined design principles for building loosely coupled services and migrated functionality incrementally using a dual read comparison approach to validate services before switching traffic. This migration approach allowed Airbnb to successfully scale its engineering team while redesigning its technical architecture.
A Quick Start Guide to create Open Scholarly Communities on the WebFrancesca Di Donato
This document provides an introduction to the Quick Start Guide, which aims to help scholars create digital scholarly communities on the web. It is divided into three sections. Section 1 discusses choosing a corpus and involving a community. Section 2 covers the technical aspects of digitizing sources, preparing digital objects, setting up infrastructure, and using the Talia platform. Section 3 examines the legal, economic, and social frameworks for digital scholarly communities, including copyright issues for primary sources, facsimiles, and editions. The document seeks contributions to expand and improve the guide.
The economies of scaling software - Abdel Remanijaxconf
This document discusses quantifying the scalability of software. It recommends instrumenting code from the beginning to collect monitoring data on application health, the entire cluster, and individual nodes' system resources. This allows measuring how well a system can handle increasing load and evolving constraints.
Drupal and its contributed modules provides an impressive amount of functionality without needing to write a single line of code by storing information in Drupal’s database tables. Unfortunately this poses a challenge for developers wanting to stage changes between servers. This talk starts to address these issues by describing the problem and presenting a variety of solutions as well as their pros and cons. I also discuss some possible paths to make this easier coming down the pipe.
Serverless: when functions and GitOps collideEdward Wilde
Abstract
--------
This talk introduces OpenFaaS, an open source platform for serverless functions. We begin by installing, running the platform and invoking a machine learning function within 60 seconds (ish). Next we step up and look at the level of architecture and key pillars of OpenFaaS: developer first, operator friendly and community focused. We close by looking at a multiuser platform built on top OpenFaas.
The objective of this talk is openFaaS the serverless platform and OpenFaaS cloud the multi-user environment for practicing git ops with functions.
Bio
---
Edward Wilde is a core contributor to the OpenFaaS project and platform architect at Form3
Moving From Infrastructure Automation To True DevOpsXebiaLabs
The document discusses different approaches to implementing DevOps. It describes providing a platform where operations provides a runtime environment and applications are developed to run on that platform. It also describes integrated full-stack development teams where developers work across the entire development process. It emphasizes that there is no single right approach and organizations need to determine what works best for their context based on factors like organizational structure and software used. The goal of any approach should be quantifiable benefits like faster development and reduced costs.
A a sneak peek into PHP 7, Zend Server 9, Zend Studio 13.5, and what the combination brings to IBM i users. Available in Q3, sporting a new DB2 extension and lots of new goodies, Zend Server 9 will prove to be the biggest open source story to hit IBM i this year. Once you add the latest Zend Studio release (13.5), you’ll be well on your way to full web enablement.
But we're already open source! Why would I want to bring my code to Apache?gagravarr
From ApacheCon Europe 2015 in Budapest
So, your business has already opened sourced some of its code? Great! Or you're thinking about it? That's fine! But now, someone's asking you about giving it to these Apache people? What's up with that, and why isn't just being open source enough?
In this talk, we'll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We'll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We'll also look briefly at where it may not be the right fit.
Wondering about how to take your business's open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you!
The ""Apache Way"" is the process by which Apache Software Foundation projects are managed. It has evolved over many years and has produced over 100 highly successful open source projects. But what is it and how does it work?
This document discusses how taxonomy and content delivery can improve the customer experience across multiple touchpoints. It provides examples of how a shared taxonomy can enable faceted search and content personalization in a documentation portal, knowledge base, and customer community. The document also outlines use cases for intelligent context-sensitive help, content entitlements for specific user roles, and delivering personalized technical documentation to IoT devices. Best practices highlighted include implementing an agile taxonomy process, analyzing customer journeys, and using a dynamic publishing system to support all customer touchpoints.
War stories from building the Global Patent Search Network, and why Data folks need to think more about UX and Discovery, and UX folks need to think more about Data.
This document provides a case study on a project created using open source technology. It discusses analyzing project goals and resources, evaluating open source options based on total cost of ownership, implementing a solution using LAMP stack, and lessons learned. The project was developed using Linux, Apache, MySQL, and PHP based on the needs of a low budget, ability to invest in internal skills, and reduce dependency on external trends. Key steps included preparing the Linux server, using version control and local testing, and engaging the open source community for support.
Stop Building It From Scratch: Creating Reusable eLearning ComponentsChad Udell
Is your organization constantly going through the same development path to produce your e-Learning... rehashing the same code, digging through line after line of spaghetti, and not seeing any real reuse benefits? If you want to leverage your intellectual property, and put your toolset to work for you, come ready to take note of practical tools, tips, and techniques you can employ immediately to enjoy productivity benefits.
In this session, you’ll learn secrets that Rapid Development gurus use. If you are tired of simple tools that speed up development, but really tie your hands when it comes to true customization, you need to learn more about application programming interfaces (APIs). The Web 2.0 world has largely been shaped by the emergence of Web services and the dominance of XML, yet so few e Learning development programs allow you to tap into those powerful tools. You’ll leave this session with ideas on how to build your own APIs, or use preexisting ones, right away.
In this session, you will learn:
* The benefits of building content in a reusable format
* Practical examples of reusable e-Learning concepts
* How to apply object-oriented development techniques to e-Learning
* What APIs are, and how you can use them to speed up development
* Techniques for designing your own e-Learning APIs
* Designing data schemas for flexibility
Audience:
Advanced designers and developers with basic programming skills in ActionScript, and who have edited XML. Deeper understanding of XML concepts and terminology will assist in greater understanding of this topic.
Cassandra Day NY 2014: Message Architectures in Distributed Systems at Simple...DataStax Academy
Eric will be presenting on SimpleReach's use of message architectures and why they an important part of a distributed system stack. They are often overlooked because the prevailing sentiment is that the storage and processing engines are the most important aspects of the system. Without the highways, the data won’t be able to get to its destination.
Summit 16: The Practice of Building OPNFV User Group in ChinaOPNFV
The Practice of Building OPNFV User Group in China(Liang(Kubi) Gao) - when we talking about community, community is about relationship. We developing community by developing relationship. So building user group is an important thing for OPNFV Community. In this presentation, Liang Gao will review recent OPNFV meetups in China. and introduce their practice to build user group in China, including(meetup.com, WeChat group, WeChat Official Accounts and so on) ,and will show the plan of building user group in China.
Navigating the Incubator at the Apache Software FoundationBrett Porter
The document discusses navigating the Apache Incubator process for open source projects. It explains that the Incubator helps projects join the Apache Foundation by ensuring they meet legal standards and the collaborative development model. Projects go through an incubation phase where they build community, follow procedures, and release versions before attempting to graduate. Not all projects will graduate but the Incubator addresses important issues for projects seeking to join Apache.
Core Principles of CI was a presentation that covered:
1) The definition of continuous integration as automated builds that run many times a day.
2) The key aspects of CI including source code compilation, database integration, testing, inspection, deployment, feedback, and producing deployable software.
3) The benefits of CI like reduced human error, rapid feedback cycles, increased project visibility and confidence, and facilitating communication.
Keeping the Content Train on the Tracks (And on Topic)Kristen Eberlein
How do you keep your content moving forward, while you are weighed down with baggage from the past? How do you change course when the tracks ahead seem to have washed away? Come hear the editors of the DITA 1.3 specification talk about creative strategies they put into place to streamline the process of developing a version of the specification, improve the quality and usability of the content, and get to their destination on schedule.
[Annotated] QConSF 2018: Airbnb's Great Migration - From Monolith to Service-...Jessica Tai
Airbnb migrated from a monolithic architecture with a single codebase to a service-oriented architecture (SOA) with microservices. As the engineering team grew rapidly from 90 to over 1000 people, the monolith became difficult to scale and maintain. Airbnb defined design principles for building loosely coupled services and migrated functionality incrementally using a dual read comparison approach to validate services before switching traffic. This migration approach allowed Airbnb to successfully scale its engineering team while redesigning its technical architecture.
A Quick Start Guide to create Open Scholarly Communities on the WebFrancesca Di Donato
This document provides an introduction to the Quick Start Guide, which aims to help scholars create digital scholarly communities on the web. It is divided into three sections. Section 1 discusses choosing a corpus and involving a community. Section 2 covers the technical aspects of digitizing sources, preparing digital objects, setting up infrastructure, and using the Talia platform. Section 3 examines the legal, economic, and social frameworks for digital scholarly communities, including copyright issues for primary sources, facsimiles, and editions. The document seeks contributions to expand and improve the guide.
The economies of scaling software - Abdel Remanijaxconf
This document discusses quantifying the scalability of software. It recommends instrumenting code from the beginning to collect monitoring data on application health, the entire cluster, and individual nodes' system resources. This allows measuring how well a system can handle increasing load and evolving constraints.
Drupal and its contributed modules provides an impressive amount of functionality without needing to write a single line of code by storing information in Drupal’s database tables. Unfortunately this poses a challenge for developers wanting to stage changes between servers. This talk starts to address these issues by describing the problem and presenting a variety of solutions as well as their pros and cons. I also discuss some possible paths to make this easier coming down the pipe.
Serverless: when functions and GitOps collideEdward Wilde
Abstract
--------
This talk introduces OpenFaaS, an open source platform for serverless functions. We begin by installing, running the platform and invoking a machine learning function within 60 seconds (ish). Next we step up and look at the level of architecture and key pillars of OpenFaaS: developer first, operator friendly and community focused. We close by looking at a multiuser platform built on top OpenFaas.
The objective of this talk is openFaaS the serverless platform and OpenFaaS cloud the multi-user environment for practicing git ops with functions.
Bio
---
Edward Wilde is a core contributor to the OpenFaaS project and platform architect at Form3
Moving From Infrastructure Automation To True DevOpsXebiaLabs
The document discusses different approaches to implementing DevOps. It describes providing a platform where operations provides a runtime environment and applications are developed to run on that platform. It also describes integrated full-stack development teams where developers work across the entire development process. It emphasizes that there is no single right approach and organizations need to determine what works best for their context based on factors like organizational structure and software used. The goal of any approach should be quantifiable benefits like faster development and reduced costs.
A a sneak peek into PHP 7, Zend Server 9, Zend Studio 13.5, and what the combination brings to IBM i users. Available in Q3, sporting a new DB2 extension and lots of new goodies, Zend Server 9 will prove to be the biggest open source story to hit IBM i this year. Once you add the latest Zend Studio release (13.5), you’ll be well on your way to full web enablement.
But we're already open source! Why would I want to bring my code to Apache?gagravarr
From ApacheCon Europe 2015 in Budapest
So, your business has already opened sourced some of its code? Great! Or you're thinking about it? That's fine! But now, someone's asking you about giving it to these Apache people? What's up with that, and why isn't just being open source enough?
In this talk, we'll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We'll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We'll also look briefly at where it may not be the right fit.
Wondering about how to take your business's open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you!
The ""Apache Way"" is the process by which Apache Software Foundation projects are managed. It has evolved over many years and has produced over 100 highly successful open source projects. But what is it and how does it work?
Corel Draw es una de las herramientas más utilizadas para el diseño gráfico debido a que permite crear ilustraciones vectoriales y trabajar con imágenes, y ha ido incorporando aplicaciones como Photo Paint y Corel Trace. En la actualidad forma parte de un paquete más amplio llamado CorelDraw Graphics Suite que facilita la edición de fotografías, la conversión de imágenes raster a vectoriales y la creación de animaciones.
An overview of all the different content related technologies at the Apache Software Foundation
Talk from ApacheCon NA 2010 in Atlanta in November 2010
What's With The 1S And 0S? Making Sense Of Binary Data At Scale With Tika And...gagravarr
This document provides an overview of Apache Tika, an open source toolkit for detecting and extracting metadata and structured text from various file formats. It discusses Tika's capabilities for detecting file types using filename extensions, magic bytes, and parsing file containers. It also describes how Tika extracts metadata, plain text, and XHTML from files and supports detecting text encodings and languages. The document outlines different ways to extend and customize Tika, as well as various options for integrating and running Tika programs.
The Greater Cleveland Sports Commission (GCSC) is a non-profit organization that has attracted nearly 100 national sporting events to Cleveland with an economic impact of $300 million. It organizes several large youth tournaments annually through the Continental Cup, including soccer, baseball, basketball, and lacrosse. The Continental Cup has experienced rapid 500% growth since 2006 and now includes nearly 200 teams and over 3,000 athletes from around the world each year.
How Big is Big – Tall, Grande, Venti Data?gagravarr
Apache has a wide range of Big Data projects, some suitable for smaller problem sets, some which scale to huge problems. Today though, that one label "Big Data" can cause confusion for new users, as they may struggle to pick the right project for the right scale for their problem.
Do we need new titles for different kinds of Big Data? Does the buzz and VC funding cause confusion? Is the humble requirement dead? Or can we help new users better find the right Apache project for them?
If You Have The Content, Then Apache Has The Technology!gagravarr
This document provides a summary of 46 Apache content-related projects and 8 content-related incubating projects. It discusses projects for transforming and reading content like Apache PDFBox, POI, and Tika. Projects for text and language analysis like UIMA, OpenNLP, and Mahout. Projects that work with structured data and linked data like Any23, Stanbol, and Jena. Projects for data management and processing on Hadoop like MRQL, DataFu, and Falcon. Projects for serving content like HTTPD Server, TrafficServer, and Tomcat. Projects that focus on generating content like OpenOffice, Forrest, and Abdera. And projects for working with hosted content like Chemistry and ManifoldCF. The document
This document discusses visual thinking and provides guidance on mind mapping and visual note taking. It begins with an introduction slide about the presenter and an overview of topics to be covered, which include mind mapping and visual note taking. The presentation then provides instructions on how to create a mind map, highlighting key elements such as starting in the center, using branches, colors, images and keywords. The document encourages attendees to try mind mapping and visual note taking.
The other Apache Technologies your Big Data solution needsgagravarr
The document discusses many Apache projects relevant to big data solutions, including projects for loading and querying data like Pig and Gora, building MapReduce jobs like Avro and Thrift, cloud computing with LibCloud and DeltaCloud, and extracting information from unstructured data with Tika, UIMA, OpenNLP, and cTakes. It also mentions utility projects like Chemistry, JMeter, Commons, and ManifoldCF.
What's with the 1s and 0s? Making sense of binary data at scale - Berlin Buzz...gagravarr
If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn't scale, mechanical turks or no! Luckily, there are open source projects and libraries out there which can help, and which can scale!
In this talk, we'll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We'll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We'll see how to use things like Apache Tika to do this, along with some other libraries to complement it. Once that part's all sorted, we'll look at how to roll this all out for a large-scale Search or Big Data setup, helping you turn those 1s and 0s into useful content at scale!
This talk was given at Berlin Buzzwords 2015
From the Fast Feather Track at ApacheCon NA 2010 in Atlanta
This quick talk provides an overview of Apache Tika, looks at a new features and supported file formats. It then shows how to create a new parser, and finishes with using Tika from your own application.
A presentation from ApacheCon Europe 2015 / Apache Big Data Europe 2015
Apache Tika detects and extracts metadata and text from a huge range of file formats and types. From Search to Big Data, single file to internet scale, if you've got files, Tika can help you get out useful information!
Apache Tika has been around for nearly 10 years now, and in that time, a lot has changed. Not only has the number of formats supported gone up and up, but the ways of using Tika have expanded, and some of the philosophies on the best way to handle things have altered with experience. Tika has gained support for a wide range of programming languages to, and more recently, Big-Data scale support, and ways to automatically compare effects of changes to the library.
Whether you're an old-hand with Tika looking to know what's hot or different, or someone new looking to learn more about the power of Tika, this talk will have something in it for you!
What's with the 1s and 0s? Making sense of binary data at scale with Tika and...gagravarr
If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn't scale, mechanical turks or no! Luckily, there are Apache projects out there which can help!
In this talk, we'll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We'll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We'll see how to do all of this with Apache Tika, and how to dive down to the underlying libraries (including its Apache friends like POI and PDFBox) for specialist cases. Finally, we'll look a little bit about how to roll this all out on a Big Data or Large-Search case.
Créer une communauté open source: pourquoi ? comment ?Stefane Fermigier
The document discusses strategies for creating an open source community, including choosing an open source license, developing a website to promote the project and engage users, using tools like GitHub and StackOverflow to facilitate collaboration, and providing a clear roadmap and engaging with contributors to build and maintain an active developer community. It also provides recommendations for open source project maintainers, such as making contributions easy, thanking contributors, and distinguishing community support from sales interactions.
Intro to open source - 101 presentationJavier Perez
This document provides an overview of open-source software and how to get started with it. It discusses the history of open-source software dating back to 1955. It defines key open-source concepts like licenses, roles, and best practices for contributing. It also highlights the large open-source ecosystems existing today and the top companies contributing to open-source. The document aims to address common questions or concerns about open-source software.
This document provides an overview of contributing to open source software projects. It discusses what open source software is, how open source projects generally work, and offers advice on how to make a first contribution. Specifically, it recommends joining a project's mailing list, finding an issue to work on, making the code change along with any necessary tests or documentation, and submitting the contribution for review. It also uses the curl and Firefox projects as examples to illustrate different open source project structures and contribution processes.
Webinar: How to contribute to Apache Flink - Robert MetzgerVerverica
Robert Metzger will introduce you to the various ways of contributing to Apache Flink. We will start by an introduction into the Apache Software Foundation, its history and present-day status, and where Apache Flink fits within the Foundation. We’ll then give an overview of the different areas to contribute, and what to consider in those areas.
For example, how can I help users best on the user@flink mailing list, how are decisions being made in the community, how can I help with Flink releases or how can I contribute code to the project.
How to Contribute to Apache Flink (and Flink at the Apache Software Foundation)Robert Metzger
This document discusses how to contribute to the Apache Flink project. It provides an overview of the Apache Software Foundation and Flink's role within it. It describes the various roles within Apache projects like Flink, including committers, PMC members, and contributors. It outlines ways to contribute such as through user support, documentation, code contributions, and reviews. The document emphasizes that there are many paths to contribute and all contributions are welcome.
The Apache Way: A Proven Way Toward SuccessEvans Ye
With innumerous successful Apache projects that dominate the big data world, the working model of Apache communities clearly deserved a study. In this talk, I'll walk you through how Apache communities and the Apache Software Foundation work generally. The whole thing behinds it is so called "The Apache Way".
For audience whose an engineer, I'll share with you why you should be part of the Apache family, how to do it, and what you can get from it. Moreover, I'll cover this with some actionable tips, and closing up with some career advices. For those being managers or at CXO level, I'll talk about some aspects on building engineering culture which can alternately pace your team and business toward success.
This document provides an overview of open source software. It defines open source as software where the source code is publicly available and can be modified and shared by anyone. The open source movement began in the 1980s with free software and the term open source was coined in 1998. Open source uses various licenses to grant users freedom to use, modify, and share software. It offers advantages like lower costs, transparency, and collaborative development but also challenges in business models and quality assurance. Common development tools include version control and mailing lists while popular open source products include Linux, Firefox, and Android.
Lipstick on a Pig: Integrated Library SystemsDorothea Salo
This document discusses communication strategies for library colleagues, including writing concisely and focusing on the purpose and audience. It also summarizes tools like Creative Commons licensing and maintaining an online presence. Finally, it examines software development models and challenges facing library systems, including integrating library data and collections onto the open web.
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...Lucas Jellema
For a long time I have been reluctant to actively contribute to an open source project. I thought it would be rather complicated and demanding – and that I didn't have the knowledge or skills for it or at the very least that they (the project team) weren't waiting for me.
In December 2021, I decided to have a serious input into the Dapr.io project – and now finally to determine how it works and whether it is really that complicated. In this session I want to tell you about my experiences. How Fork, Clone, Branch, Push (and PR) is the rhythm of contributing to an open source project and how you do that (these are all Git actions against GitHub repositories). How to learn how such a project functions and how to connect to it; which tools are needed, which communication channels are used. I tell how the standards of the project – largely automatically enforced – help me to become a better software engineer, with an eye for readability and testability of the code.
How the review process is quite exciting once you have offered your contribution. And how the final "merge to master" of my contribution and then the actual release (Dapr 1.6 contains my first contribution) are nice milestones.
I hope to motivate participants in this session to also take the step yourself and contribute to an open source project in the form of issues or samples, documentation or code. It's valuable to the community and the specific project and I think it's definitely a valuable experience for the "contributer". I looked up to it and now that I've done it gives me confidence – and it tastes like more (I could still use some help with the work on Dapr.io, by the way).
The document discusses the concept of "Docs Like Code", which treats documentation like code by storing docs in version control systems, using plain text formats, and integrating doc writing and publishing into the same workflow as software development. It provides the case study of Apache Pulsar, which uses GitHub and other tools to collaborate effectively on docs between developers, writers and users. Benefits include better doc quality and syncing with code through continuous integration/deployment of docs.
Presentation mainly deals with Open Source and how Os projects work? Who does it? Why they do it? Why you should contribute to Open Source? Different ways of contribution.
[Workshop] Building an Integration Agile Digital Enterprise with Open Source ...WSO2
This document provides an overview of open source software. It discusses why organizations use open source software, noting benefits like more control over the software, increased security, support for interoperability, and guaranteed future development. It also covers the differences between free and open source software. The document outlines several open source foundations and their major projects. It explores open source philosophies like community over code and the cathedral and bazaar models of development. Finally, it addresses understanding open source infrastructure like mailing lists, version control, issue trackers, wikis, documentation, and websites.
Pharo Consortium: A roadmap to solid evolutionESUG
Title: Pharo Consortium: A roadmap to solid evolution.
Type:
Abstract: The Pharo Consortium takes action to guarantee the growth of Pharo as a community organisation as well as supports its development into ever-changing requirements of today’s computing needs.
In this talk I will present the current status of Pharo Consortium, its consolidation as Pharo governance structure and the actions taken to make Pharo a more robust environment.
I will also review the Pharo 7 development and direction for Pharo 8 and beyond.
Bio: Esteban Lorenzano studied Computer Sciences at Universidad de Buenos Aires, and worked since 1994 in several object-oriented and low-level technologies, in different software companies, serving in various positions from programmer to senior architect.
In 2007 he co-founded Smallworks to offer Pharo-based agile development projects. Since 2012 he dedicated full time to developing the Pharo code and community.
He works with the INRIA-RMoD team in Lille, France, as core developer for Pharo, being responsible with the coordination of new releases, the implementation and maintenance of Pharo libraries and the maintenance of the Pharo flavour of the Cog Virtual Machine, FFI integration and plugins in all major platforms (OSX, Linux and Windows).
The roadmap of Pharo 7 and Pharo 8.
Pharo is a pure object-oriented programming language and a powerful environment, focused on simplicity and immediate feedback (think IDE and OS rolled into one).
The Open Source Geospatial Foundation does much more than hold FOSS4G each year.
This talk will look into what makes OSGeo a software foundation. What software foundations have to offer members, software projects and developers.
This talk is structured around the “incubation” process by which new software projects join the OSGeo.
If you are new to open source take this is a great chance to see how OSGeo evaluates software projects and how these checks protect you!
For managers it is especially important to understand the risks associated with the use of open source. Understand what assurances OSGeo incubation offers, how to double check the results, and what factors are left for your own risk assessment.
If you are a developer considering getting involved in OSGeo this is great talk to learn what is involved, how much work it will be, and how you can start!
Come see what makes OSGeo more than a user group!
Customer Success is a core value at Alfresco, and for this we have invested in people, processes and tools to maximize the possibility of our Partners and Customers succeeding in Alfresco Projects. In this session, Gab will share his 5+ years experience on running successful Alfresco projects, providing a comprehensive but practical set of recommendations, to overcome common business & technical hurdles of an Enterprise wide ECM implementation and ensure continued success for your projects.
This preso also introduces the first pilot of Alfresco Developer Support service, geared towards helping customers and partners developers in developing high quality, scalable and high longevity Alfresco customizations, as well as to help you support those customizations in production.
Leaping the chasm from proprietary to open: A survivor's guidebcantrill
This document summarizes the history of open source software and the challenges of transitioning proprietary software to open source. It discusses how Sun open sourced Solaris but Oracle later closed it, leading to the illumos community. It also describes how Joyent open sourced its SmartDataCenter and Manta software after initially being proprietary, noting the business reasons and technical challenges involved in such a transition. The document advocates for permissive licensing and avoiding strong governance for most open source projects.
Similar to But We're Already Open Source! Why Would I Want To Bring My Code To Apache? (20)
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframePrecisely
Inconsistent user experience and siloed data, high costs, and changing customer expectations – Citizens Bank was experiencing these challenges while it was attempting to deliver a superior digital banking experience for its clients. Its core banking applications run on the mainframe and Citizens was using legacy utilities to get the critical mainframe data to feed customer-facing channels, like call centers, web, and mobile. Ultimately, this led to higher operating costs (MIPS), delayed response times, and longer time to market.
Ever-changing customer expectations demand more modern digital experiences, and the bank needed to find a solution that could provide real-time data to its customer channels with low latency and operating costs. Join this session to learn how Citizens is leveraging Precisely to replicate mainframe data to its customer channels and deliver on their “modern digital bank” experiences.
In the realm of cybersecurity, offensive security practices act as a critical shield. By simulating real-world attacks in a controlled environment, these techniques expose vulnerabilities before malicious actors can exploit them. This proactive approach allows manufacturers to identify and fix weaknesses, significantly enhancing system security.
This presentation delves into the development of a system designed to mimic Galileo's Open Service signal using software-defined radio (SDR) technology. We'll begin with a foundational overview of both Global Navigation Satellite Systems (GNSS) and the intricacies of digital signal processing.
The presentation culminates in a live demonstration. We'll showcase the manipulation of Galileo's Open Service pilot signal, simulating an attack on various software and hardware systems. This practical demonstration serves to highlight the potential consequences of unaddressed vulnerabilities, emphasizing the importance of offensive security practices in safeguarding critical infrastructure.
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...Alex Pruden
Folding is a recent technique for building efficient recursive SNARKs. Several elegant folding protocols have been proposed, such as Nova, Supernova, Hypernova, Protostar, and others. However, all of them rely on an additively homomorphic commitment scheme based on discrete log, and are therefore not post-quantum secure. In this work we present LatticeFold, the first lattice-based folding protocol based on the Module SIS problem. This folding protocol naturally leads to an efficient recursive lattice-based SNARK and an efficient PCD scheme. LatticeFold supports folding low-degree relations, such as R1CS, as well as high-degree relations, such as CCS. The key challenge is to construct a secure folding protocol that works with the Ajtai commitment scheme. The difficulty, is ensuring that extracted witnesses are low norm through many rounds of folding. We present a novel technique using the sumcheck protocol to ensure that extracted witnesses are always low norm no matter how many rounds of folding are used. Our evaluation of the final proof system suggests that it is as performant as Hypernova, while providing post-quantum security.
Paper Link: https://eprint.iacr.org/2024/257
The Microsoft 365 Migration Tutorial For Beginner.pptxoperationspcvita
This presentation will help you understand the power of Microsoft 365. However, we have mentioned every productivity app included in Office 365. Additionally, we have suggested the migration situation related to Office 365 and how we can help you.
You can also read: https://www.systoolsgroup.com/updates/office-365-tenant-to-tenant-migration-step-by-step-complete-guide/
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving
Manufacturing custom quality metal nameplates and badges involves several standard operations. Processes include sheet prep, lithography, screening, coating, punch press and inspection. All decoration is completed in the flat sheet with adhesive and tooling operations following. The possibilities for creating unique durable nameplates are endless. How will you create your brand identity? We can help!
What is an RPA CoE? Session 1 – CoE VisionDianaGray10
In the first session, we will review the organization's vision and how this has an impact on the COE Structure.
Topics covered:
• The role of a steering committee
• How do the organization’s priorities determine CoE Structure?
Speaker:
Chris Bolin, Senior Intelligent Automation Architect Anika Systems
Essentials of Automations: Exploring Attributes & Automation ParametersSafe Software
Building automations in FME Flow can save time, money, and help businesses scale by eliminating data silos and providing data to stakeholders in real-time. One essential component to orchestrating complex automations is the use of attributes & automation parameters (both formerly known as “keys”). In fact, it’s unlikely you’ll ever build an Automation without using these components, but what exactly are they?
Attributes & automation parameters enable the automation author to pass data values from one automation component to the next. During this webinar, our FME Flow Specialists will cover leveraging the three types of these output attributes & parameters in FME Flow: Event, Custom, and Automation. As a bonus, they’ll also be making use of the Split-Merge Block functionality.
You’ll leave this webinar with a better understanding of how to maximize the potential of automations by making use of attributes & automation parameters, with the ultimate goal of setting your enterprise integration workflows up on autopilot.
5th LF Energy Power Grid Model Meet-up SlidesDanBrown980551
5th Power Grid Model Meet-up
It is with great pleasure that we extend to you an invitation to the 5th Power Grid Model Meet-up, scheduled for 6th June 2024. This event will adopt a hybrid format, allowing participants to join us either through an online Mircosoft Teams session or in person at TU/e located at Den Dolech 2, Eindhoven, Netherlands. The meet-up will be hosted by Eindhoven University of Technology (TU/e), a research university specializing in engineering science & technology.
Power Grid Model
The global energy transition is placing new and unprecedented demands on Distribution System Operators (DSOs). Alongside upgrades to grid capacity, processes such as digitization, capacity optimization, and congestion management are becoming vital for delivering reliable services.
Power Grid Model is an open source project from Linux Foundation Energy and provides a calculation engine that is increasingly essential for DSOs. It offers a standards-based foundation enabling real-time power systems analysis, simulations of electrical power grids, and sophisticated what-if analysis. In addition, it enables in-depth studies and analysis of the electrical power grid’s behavior and performance. This comprehensive model incorporates essential factors such as power generation capacity, electrical losses, voltage levels, power flows, and system stability.
Power Grid Model is currently being applied in a wide variety of use cases, including grid planning, expansion, reliability, and congestion studies. It can also help in analyzing the impact of renewable energy integration, assessing the effects of disturbances or faults, and developing strategies for grid control and optimization.
What to expect
For the upcoming meetup we are organizing, we have an exciting lineup of activities planned:
-Insightful presentations covering two practical applications of the Power Grid Model.
-An update on the latest advancements in Power Grid -Model technology during the first and second quarters of 2024.
-An interactive brainstorming session to discuss and propose new feature requests.
-An opportunity to connect with fellow Power Grid Model enthusiasts and users.
Driving Business Innovation: Latest Generative AI Advancements & Success StorySafe Software
Are you ready to revolutionize how you handle data? Join us for a webinar where we’ll bring you up to speed with the latest advancements in Generative AI technology and discover how leveraging FME with tools from giants like Google Gemini, Amazon, and Microsoft OpenAI can supercharge your workflow efficiency.
During the hour, we’ll take you through:
Guest Speaker Segment with Hannah Barrington: Dive into the world of dynamic real estate marketing with Hannah, the Marketing Manager at Workspace Group. Hear firsthand how their team generates engaging descriptions for thousands of office units by integrating diverse data sources—from PDF floorplans to web pages—using FME transformers, like OpenAIVisionConnector and AnthropicVisionConnector. This use case will show you how GenAI can streamline content creation for marketing across the board.
Ollama Use Case: Learn how Scenario Specialist Dmitri Bagh has utilized Ollama within FME to input data, create custom models, and enhance security protocols. This segment will include demos to illustrate the full capabilities of FME in AI-driven processes.
Custom AI Models: Discover how to leverage FME to build personalized AI models using your data. Whether it’s populating a model with local data for added security or integrating public AI tools, find out how FME facilitates a versatile and secure approach to AI.
We’ll wrap up with a live Q&A session where you can engage with our experts on your specific use cases, and learn more about optimizing your data workflows with AI.
This webinar is ideal for professionals seeking to harness the power of AI within their data management systems while ensuring high levels of customization and security. Whether you're a novice or an expert, gain actionable insights and strategies to elevate your data processes. Join us to see how FME and AI can revolutionize how you work with data!
Your One-Stop Shop for Python Success: Top 10 US Python Development Providersakankshawande
Simplify your search for a reliable Python development partner! This list presents the top 10 trusted US providers offering comprehensive Python development services, ensuring your project's success from conception to completion.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/temporal-event-neural-networks-a-more-efficient-alternative-to-the-transformer-a-presentation-from-brainchip/
Chris Jones, Director of Product Management at BrainChip , presents the “Temporal Event Neural Networks: A More Efficient Alternative to the Transformer” tutorial at the May 2024 Embedded Vision Summit.
The expansion of AI services necessitates enhanced computational capabilities on edge devices. Temporal Event Neural Networks (TENNs), developed by BrainChip, represent a novel and highly efficient state-space network. TENNs demonstrate exceptional proficiency in handling multi-dimensional streaming data, facilitating advancements in object detection, action recognition, speech enhancement and language model/sequence generation. Through the utilization of polynomial-based continuous convolutions, TENNs streamline models, expedite training processes and significantly diminish memory requirements, achieving notable reductions of up to 50x in parameters and 5,000x in energy consumption compared to prevailing methodologies like transformers.
Integration with BrainChip’s Akida neuromorphic hardware IP further enhances TENNs’ capabilities, enabling the realization of highly capable, portable and passively cooled edge devices. This presentation delves into the technical innovations underlying TENNs, presents real-world benchmarks, and elucidates how this cutting-edge approach is positioned to revolutionize edge AI across diverse applications.
Ivanti’s Patch Tuesday breakdown goes beyond patching your applications and brings you the intelligence and guidance needed to prioritize where to focus your attention first. Catch early analysis on our Ivanti blog, then join industry expert Chris Goettl for the Patch Tuesday Webinar Event. There we’ll do a deep dive into each of the bulletins and give guidance on the risks associated with the newly-identified vulnerabilities.
HCL Notes and Domino License Cost Reduction in the World of DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-and-domino-license-cost-reduction-in-the-world-of-dlau/
The introduction of DLAU and the CCB & CCX licensing model caused quite a stir in the HCL community. As a Notes and Domino customer, you may have faced challenges with unexpected user counts and license costs. You probably have questions on how this new licensing approach works and how to benefit from it. Most importantly, you likely have budget constraints and want to save money where possible. Don’t worry, we can help with all of this!
We’ll show you how to fix common misconfigurations that cause higher-than-expected user counts, and how to identify accounts which you can deactivate to save money. There are also frequent patterns that can cause unnecessary cost, like using a person document instead of a mail-in for shared mailboxes. We’ll provide examples and solutions for those as well. And naturally we’ll explain the new licensing model.
Join HCL Ambassador Marc Thomas in this webinar with a special guest appearance from Franz Walder. It will give you the tools and know-how to stay on top of what is going on with Domino licensing. You will be able lower your cost through an optimized configuration and keep it low going forward.
These topics will be covered
- Reducing license cost by finding and fixing misconfigurations and superfluous accounts
- How do CCB and CCX licenses really work?
- Understanding the DLAU tool and how to best utilize it
- Tips for common problem areas, like team mailboxes, functional/test users, etc
- Practical examples and best practices to implement right away
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-und-domino-lizenzkostenreduzierung-in-der-welt-von-dlau/
DLAU und die Lizenzen nach dem CCB- und CCX-Modell sind für viele in der HCL-Community seit letztem Jahr ein heißes Thema. Als Notes- oder Domino-Kunde haben Sie vielleicht mit unerwartet hohen Benutzerzahlen und Lizenzgebühren zu kämpfen. Sie fragen sich vielleicht, wie diese neue Art der Lizenzierung funktioniert und welchen Nutzen sie Ihnen bringt. Vor allem wollen Sie sicherlich Ihr Budget einhalten und Kosten sparen, wo immer möglich. Das verstehen wir und wir möchten Ihnen dabei helfen!
Wir erklären Ihnen, wie Sie häufige Konfigurationsprobleme lösen können, die dazu führen können, dass mehr Benutzer gezählt werden als nötig, und wie Sie überflüssige oder ungenutzte Konten identifizieren und entfernen können, um Geld zu sparen. Es gibt auch einige Ansätze, die zu unnötigen Ausgaben führen können, z. B. wenn ein Personendokument anstelle eines Mail-Ins für geteilte Mailboxen verwendet wird. Wir zeigen Ihnen solche Fälle und deren Lösungen. Und natürlich erklären wir Ihnen das neue Lizenzmodell.
Nehmen Sie an diesem Webinar teil, bei dem HCL-Ambassador Marc Thomas und Gastredner Franz Walder Ihnen diese neue Welt näherbringen. Es vermittelt Ihnen die Tools und das Know-how, um den Überblick zu bewahren. Sie werden in der Lage sein, Ihre Kosten durch eine optimierte Domino-Konfiguration zu reduzieren und auch in Zukunft gering zu halten.
Diese Themen werden behandelt
- Reduzierung der Lizenzkosten durch Auffinden und Beheben von Fehlkonfigurationen und überflüssigen Konten
- Wie funktionieren CCB- und CCX-Lizenzen wirklich?
- Verstehen des DLAU-Tools und wie man es am besten nutzt
- Tipps für häufige Problembereiche, wie z. B. Team-Postfächer, Funktions-/Testbenutzer usw.
- Praxisbeispiele und Best Practices zum sofortigen Umsetzen
3. Open, Open and Open!
• Open vs Open Source
• The ASF – What's the deal?
• Case Study 1 – Alfresco and Apache Tika
• Case Study 2 – Alfresco and Apache Chemistry
• Case Study 3 – Alfresco and Activiti
• The Benefits
• And the downsides...
5. Open Source
“denoting software for which the original source code is made
freely available and may be redistributed and modified”
• You can get the source behind a program (if you want)
• You can study it, learn from it, understand it, debug it
• You can make changes to it (or pay someone else to)
• You can use your new version without having to pay
• You can share your changes and improvements
• There is a license, but it's a distribution not use one
6. Open Development
• Open Source is all about the software today
• Open Development is about the software tomorrow
• Quite possible to be Open Source but Closed Development
• Who decides what the priorities are?
• Where do those discussions happen, and are they shared?
• Who can influence the direction of the project?
• Who can contribute (directly or indirectly)?
• Who dictates releases, “done” etc?
7. Copyright Assignments
• Applies when you want to contribute a fix or enhancement
• Assignement requires you sign over your rights to your
changes to another entity, to do with as they wish
• Most common with Commercial+CopyLeft
• Can be used “for good”, to permit relicensing in future, and
to allow a central entity to fight infringement claims
• Can be used “for money”, to allow a company to license out
your changes to their enterprise customers
• Typically very divisive, problems if not accepted
8. CLAs (License Agreements)
• At first glance, these look quite similar
• Legal agreement about a contribution
• But... Very different kettle of fish!
• Copyright Assignment requires you hand over your rights
• CLAs are about ensuring you understand the open source
license, and are allowed to contribute under it
• “License says you can have this, and I confirm you can”
• Paper trail of contributions
• Defends against “but that's our code!”
9. Community Direction
• Within any project, whether open or closed, proprietary or
shared, there's a process to decide on direction
• When will the next release be considered “done”?
• What features should be worked on? Not worked on?
• What's the vision for the project? What's out of scope?
• Who can join in? Who can contribute?
• How are new changes reviewed+accepted? And by who?
• How are conflicts and debates handled? Who has a say?
• How can this be changed over time?
10. Licensing
• All Open Source licenses are distribution licenses
• Use of the software is free (support + add-ons may not be!)
• To broad families of Open Source licenses
• Copyleft (eg GNU-GPL, LGPL)
• Permissive (eg Apache License, BSD, MIT)
• Strong philosophical differences divide, plenty of license
flame-wars online debating all of these!
• Have different restrictions on building on top of them
• Influences the business models you can build on top
11. Many Models
• There isn't a “One True Way”
• But the ASF has a “often least worst way!”
• More on the ASF shortly
• More on pluses and minuses later
• No one model is “always right” or “always wrong”, need to
weigh up the benefits and downsides
• There is help for weighing this all up!
13. What is the ASF?
• US 501(c)3 not-for-profit foundation
• Founded in 1999 with one project (webserver)
• Today has over 150!
• Meritocratic, community driven open source
• Anyone can get involved, and not just as a coder
• Decisions taken by the community
• Work done by volunteers
• Open Development, friendly to business involvement
14. Driven by projects
• A number of projects
• Each project is responsible for their own code, community and direction
• Board provides oversight, but that's it
• Board has no say on what code gets written, nor what direction projects
take, nor what projects we should start. All of that is devolved to the
projects themselves
• Foundation has some common support (eg infra, press, trademarks), to
help projects focus on their code and on their communities
17. Alfresco
• Open Source Enterprise Content Management
• >1 million active users, >4 million downloads, plus Cloud
• Open Source, but not Open Development (mostly)
• Written mostly in Java and JavaScript
• ECM – Capture, Manage, Store, Preserve, Deliver, Search, Collaborate,
within and between organisations
• Includes Document Management, Records Management, Asset
Management, WCM, Workflow etc
• Alfresco is Open Source, and builds on lots of of Open Source projects,
especially from Apache and Spring
18. Metadata & Transformers
• Content Transformers and Metadata Extractors are core Alfresco
services, present from the very start
• Transformers are used for full text indexing, web previews, icons,
web friendly versions etc
• Metadata Extraction used to synchronise the document properties
with the repository
• Both normally used with uploaded binary files
• eg upload a Word Document, text extracted for indexing, PDF
version generated, title + author extracted
19. Alfresco 3.3 - Formats
• PDF
• Word, PowerPoint, Excel
• HTML
• Open Document Formats (OpenOffice)
• RFC822 Email
• Outlook .msg Email
•
• And that's it!
20. Apache Tika tika.apache.org
• Apache Project, started in 2006
• Initially grew out of Lucene, but now very widely used
• Provides detection capabilities – eg this is a Word Doc
• Parsers for a very wide range of formats
• Extracts metadata, and provides a consistent model, eg created by
vs author
• Textual content available as Plain Text and XHTML
• Hides complexity of file formats and librarys, presents a simple and
powerful API across all of them
• Easy to use and extend
21. Sharing process
• Identify upstream projects (mostly Apache Tika + POI)
• Get the latest versions of the code
• Compared Alfresco code to upstream, and identify areas where
Alfresco did more / upstream didn't handle
• Wrote unit tests for things Alfresco did better, to avoid regressions
after the push upstream
• Identified which bits of upstream would be improved, and what bits
would be whole new features
• Worked with upstream projects, shared ideas for enhancements,
checked hadn't missed things
22. Sharing process continued
• Produced patches, submitted to the Apache projects
• Get feedback and improvements on patches
• Gained trust of communities, granted commit rights
•
• Once most things upstream, update Tika and POI versions used in
Alfresco, check everything worked
• Wrote Tika wrappers for Alfresco
• Converted Alfresco code to call Tika
• Used unit tests to ensure no lost functionality
• Enabled other formats via Tika
23. Alfresco 4.2 - Formats
• Audio – WAV, RIFF, MIDI, MP3, MP4, Ogg Vorbis, FLAC
• CAD – DWG, PRT
• EPub
• Feeds – RSS, Atom
• HTML, XHTML, XML
• Images – JPEG, GIF, PNG, TIFF, Bitmap
• Incuding EXIF data where present
• RFC-822 MBox email
• Microsoft Outlook .msg email
24. Alfresco 4.2 - Formats
• IWorks (Keynote, Pages, Numbers)
• Microsoft Office (Binary) – Word, PowerPoint, Excel, Visio,
Publisher, Works
• Microsoft Office (OOXML) – Word, PowerPoint, Excel
• Open Document Format (OpenOffice)
• PDF
• RTF
• Plain Text
• Scientific Data – CDF, HDF
25. Alfresco 4.2 - Formats
• Archive - Zip, Tar, Tar-GZ, Tar-BZ2, Compress, Ar
• FLV Video
• MP4 Video
• Java Class Files
• CHM (Windows Help)
• Executable Libraries and Programs
• Configurable external programs
•
And probably some others I've forgotten
26. Alfresco 5.0 – New Formats
Thus far, Alfresco 5.0 is set to get the following new formats, on
release, at no extra effort beyond upgrading Apache Tika:
• Archive - 7z
• Audio – Ogg Opus and Speex
• Video – Ogg Theora
• Image – Tesseract OCR support
• Source Code
• Email – Outlook PST
• Font – Adobe Font Metrics
And maybe more before 5.0 Enterpise Final is released!
28. CMIS (OASIS Standard)
• CMIS – Content Management Interoperability Services
• Standard for talking to content repositories
• Provides a standard way to login, navigate, search, upload, version,
delete, change permissions, alter properties, set relationships,
query etc
• Supported by a wide range of content repositories
• Adobe, IBM, Microsoft, Alfresco, SAP, EMC, HP, Liferay, Nuxeo,
OpenText, Oracle, and some others
• CMIS 1.0 released in 2010
• CMIS 1.1 released in late 2012
29. First Alfresco Support
• Alfresco was involved in CMIS from an early stage
• First Alfresco CMIS support was added before the CMIS 1.0 spec
was completed, and was used to feed back on the proposed
standard
• First support was developed internally, took quite a while, and
turned out to have some issues
• Desire for a pure-Java server implementation, which would be more
rhobust and easier to test + validate
• Desire for a CMIS client for a few different platforms
• Desire not to have to do it all in-house
30. Two Rival Camps
• Alfresco, OpenText and SAP started working on an open source
Java client and server as OpenCMIS
• Nuxeo, Adobe and friends started on Apache Chemistry
• Both had strengths and weaknesses
• Quite different development and governance approaches
• Some lack of trust between camps
• Quite a lot of misunderstandings about what Apache and the
Apache Way was, and worry about a loss of control or direction,
from some in OpenCMIS
• Then an idea to contribute OpenCMIS as another project
31. The Joining Together
• Lots of debate on mailing lists and in private
• Eventually a get-together in Munich, coupled with lots of feedback
to the mailing list, and rattification there
• Outcome was a merged project, Apache Chemistry, which
combined the best of both
• Lots of activity, lead quite quickly to a CMIS-1.0 compatible test
server, server framework, client and test suite
• Took some work to implement this new server into Alfresco, but it
proved better than the in-house one
• Initially only a handful of companies used it, rest tested
32. Going Forward
• Alfresco received lots of bug fixes “for free” just from upgrading the
OpenCMIS libraries
• More companies started basing their CMIS implementations on
Apache Chemistry, having realised that it was both high quality
and already available
• Virtuous circle of enhancements and fixes
•
• CMIS 1.1 support added to Apache Chemistry mostly by SAP, with
help from others
• Adding CMIS 1.1 support to Alfresco took a tiny fraction of the time
CMIS 1.0 did, due to Chemistry support
34. BPMN-2.0 Workflows
• BPMN – Business Process Modeling Notation
• Way of describing business processes (workflows), which can be
displayed visually, but also executed
• Covers most of the key elements of a workflow
• Ideally can be created by business analysts
• Big step on from previous BPMN versions is that the format is
executable – the same thing that the analyst creates / changes
is what gets run
• Industry standard for doing this (previously each engine had its own
way of doing that)
35. Alfresco and Workflow
• Alfresco Workflows were based on jBPM
• In 2010, jBMP was LGPL and owned by Redhat
• Alfresco were finding some problems with jBPM that were hard to
fix, and seemed to be architectural
• jBPM core devs were interested in a re-write, Redhat less so
• Alfresco was trying to reduce the number of copyleft dependencies
in the codebase, jBMP was a big part
• Alfresco had moved from Hibernate to iBatis, and jBMP remainging
with Hibernate was making development + support harder
36. The Birth of Activiti
• Alfresco hired two of the core jBPM developers, Tom Baeyens and
Joram Barrez, plus some others
• Charged with developing an Apache licensed, open source
implementation of BPMN 2.0
• Alfresco was the be the “launch customer”, but being open source
anyone can (and hopefully would) use it too
• Needed some changes to Alfresco to make the the Workflow APIs
(already pluggable) work well with Activiti as well as jBMP
• Aim was for Activiti to be the default engine, and more powerful /
richer than jBPM was
37. ASL v2, but not ASF
• License was picked as Apache License v2
• However, after lots of internal debate, decision was made not to
submit the project to the Apache Incubator
• Alfresco wanted their next release to ship with Activiti, meant they
had strong pressures to hit a ship date
• Initial feature set largely dominated by what was needed for
Alfresco, other features were added but mostly after the first
release
• Tom Baeyens is a “BPM-God”, really knows his stuff, but after
Redhat frustrations wanted to drive it
• Initially not good Incubator fit, later why change?
39. Shared Maintenance
• By sharing upstream, your support overhead is shared
• Many companies, organisations and individuals can join in and
contribute towards the project
• When others extend your project, you benefit for free
• More users means greater chance of community enhancements /
new features
• More users means more bug reports, but also more bug reports
that come with fixes (patches)!
• Lower per-organisation maintenance costs
• Often faster development
40. Better Bug Reports
• People familiar with Open Source often know how to report bugs
better
• By sharing your code, more people can use it, so there's a wider pool
of people to test and report on it
• “I have found a bug, but I can't share the details of it” is less
common, and more chances of a duplicate which can
• More likely that another user can come along and fill in the details of
existing bugs
• If you're open and other companies are joining in, greater chance
someone will have dedicated testers!
41. New Ideas, New Features
• When other companies feel they can base their business on your
project, then they can dedicate more resources
• When individuals feel they have a say, they're more likely to spend
their time working on it
• More developers means more ideas for fixing problems, enhancing
exisiting code, and adding new things
• When contributions are felt to be welomed and appreciated, more
likely for people to share back
• Can be small changes over time, or can be large new features, all
for (almost) free!
42. Learning from the best
• The world is a very big place
• No matter how great you are, there's someone out there who's
smarter than you are!
• And many more people who've been there before
•
• If you have more contributors, there's more chance of someone
improving your idea or code
• By seeing how they do it better, you not only get a better project,
but learn from it to improve yourself
• Collaborate to get the best of everyone
43. Diverse Users and Devs
• We are often a bit of a mono-culture
• Some worse than others.....
•
• If you draw from a very small pool, you'll have people who tend to
think the same, and will be blinded by that
• If you have people from all around the world, from all walks of life,
you've more chance of diverse opinions
• Best time to fix a mistake is at the start
• Even if it's just “hmm, that breaks when you're not +00:00” !
44. Community and Goodwill
• Sharing and being open can bring positive press
• Your organisation can gain recognition from their involvement
• Halo effect – share in the buzz from the project
• And an open project with lots of people has more buzz!
•
• If you're a known good team player, it's easier to call in favours from
the community (eg critical bug fix)
• If you're known generally as a good team player, easier to get help or
support in related projects (eg dependencies)
46. Lack of Control
• If it's your project, you can do with it as you want
• Very easy to make decisions, get things done
• If it's an ASF project, you need to explain your ideas, and convince
the rest of the community of why
• If it's in-house, you can have a quick chat to decide
• If it's an ASF project, you need to write it all down, explain it, and wait
to allow everyone to have a chance to review
• Management understand traditional projects
• Can lead to confusion and hard questions if not properly explained,
expect some “what do you mean?”s
47. Release dates
• If it's your project, you have full control (and responsibility) about
“done”, and the associated release dates
• If it's an ASF project, it's a community decision
• You can't “demand” that someone else fixes a bug
• If the community is all excited and working on a new feature, it may
not be the right time for a stable release
• If you're on a different cycle to many other contributors, expect to
still have to do much of the “boring bits”
• Need to get the community to share release vision
48. Direction / Features
• If it's your project, you decide what's in, you direct the developers,
off they go (or fail trying....)
• If it's an ASF project, you need to explain the idea to the community,
and get their buy-in
• You may still then have to do most of the work on it – you can't
force the volunteers to work on something!
• Your vision for where it should go may not match others
• Your “value add” might end up getting re-done by the community,
and you can't stop that
• Can confuse management, who expect a different dynamic about
getting new things in
49. Marketing & Branding
• If it's your project, you can brand it how you want, associate your
company with it how you want
• ASF has trademark rules which apply, you mustn't confuse your
product with the project
• Need to co-ordinate with the rest of the community on branding,
press releases, conferences etc
• Can be hard to get agreement on these things, can be prone to
bike-shedding if not careful
• Potentially big loss of control
51. Why it's Good
• More involvement
• Better bug reports, wider testing
• Other companies (including competitors) much more willing
to join in and share resources
• Learn from others, and collaborate with them to improve
• More diverse community can mean problems get spotted
and fixed sooner (+cheaper)
• More opportunities from wider use
• New features and fixes “for free”
52. But not for everyone!
• You're no longer in charge!
• Need to be willing to give up some control in order to
welcome in the wider community
• Not instant – will be an up-front cost, you need to be in for
the longer term to take advantage
• Need to train up your marketing team, and management
• Won't all be smooth sailing, there will be problems
• BDFLs aren't allowed
• “No jerks”, you need to work with the community