De-mystifying contributing to PostgreSQLLætitia Avrot
PostgreSQL is a great community. They are open-minded, friendly, agreeable and so on. You feel like helping them.
The problem is you are shy and you look at community people as gods. On top of that you don't want to mess up with their work or bother them with obvious and silly (to them) questions!
This conference talk is based on my own true story. I will tell you about how I submitted my very first patch to the community. After some background presentation about how the community works, I will try to answer the following questions:
What can I do to help (and you'll see that even without coding you can do a lot!)?
What's a contribution?
What's a patch? How can I create one?
And I hope that sooner or later you'll come and join the community and you'll feel so proud of yourselves!
Optimizations in Spark; RDD, DataFrameKnoldus Inc.
Developing Apache Spark Jobs is the easier part of the process but the difficult portion comes in while executing them under full load as each job is unique when it comes to performance. Spark programs often face bottlenecks in terms of CPU, network bandwidth, memory usage which stems from Spark’s basic nature of in-memory computations.
In this webinar, we will deal with the problem of how optimally you can perform your job operations in Apache Spark. We will address common performance problems including -
~ Inadequate transformations when working with RDD API as optimization is the developer’s responsibility, unlike in SQL querying language.
~ Proper partitioning of data so that Spark can perform tasks optimally
~ Why DataFrames have better performance than RDD?
Here’s the agenda of the webinar -
~ Spark Execution Model
~ Optimizing Shuffle Operations
~ Optimizing Functions
~ SQL VS RDD
~ Logical & Physical Plan
~ Optimizing Joins
De-mystifying contributing to PostgreSQLLætitia Avrot
PostgreSQL is a great community. They are open-minded, friendly, agreeable and so on. You feel like helping them.
The problem is you are shy and you look at community people as gods. On top of that you don't want to mess up with their work or bother them with obvious and silly (to them) questions!
This conference talk is based on my own true story. I will tell you about how I submitted my very first patch to the community. After some background presentation about how the community works, I will try to answer the following questions:
What can I do to help (and you'll see that even without coding you can do a lot!)?
What's a contribution?
What's a patch? How can I create one?
And I hope that sooner or later you'll come and join the community and you'll feel so proud of yourselves!
Optimizations in Spark; RDD, DataFrameKnoldus Inc.
Developing Apache Spark Jobs is the easier part of the process but the difficult portion comes in while executing them under full load as each job is unique when it comes to performance. Spark programs often face bottlenecks in terms of CPU, network bandwidth, memory usage which stems from Spark’s basic nature of in-memory computations.
In this webinar, we will deal with the problem of how optimally you can perform your job operations in Apache Spark. We will address common performance problems including -
~ Inadequate transformations when working with RDD API as optimization is the developer’s responsibility, unlike in SQL querying language.
~ Proper partitioning of data so that Spark can perform tasks optimally
~ Why DataFrames have better performance than RDD?
Here’s the agenda of the webinar -
~ Spark Execution Model
~ Optimizing Shuffle Operations
~ Optimizing Functions
~ SQL VS RDD
~ Logical & Physical Plan
~ Optimizing Joins
Introduction to Redis 3.0, and it’s features and improvements. What’s difference between Redis / Memcached / Aerospike ? The strong sides of Redis, and away from the weak sides.
本議程介紹 Redis 3.0 及其歷史,探討 Redis 的特性與改進。並一併分析 Redis / Memcached / Aerospike 三者之間的差異,有助於未來面對業務場景需求提供瞭解與判斷。最後,分享 Redis 適用之場景,及其不適用場景下的備案或整合方案。議程適於 Redis 初學者、對 Redis 想深入瞭解者,及曾經莫名被 Redis 雷擊或坑殺者。
How to Become a Thought Leader in Your NicheLeslie Samuel
Are bloggers thought leaders? Here are some tips on how you can become one. Provide great value, put awesome content out there on a regular basis, and help others.
在這個資料科學蔚為風潮的年代,身為一個對新技術充滿好奇的攻城獅,自然會想要擴充自己的武器庫,學習嶄新的資料分析工具;而 R 語言,一個由統計學家專門為了資料探索與分析所開發的腳本語言,具有龐大的開源社群支持以及琳瑯滿目、數以萬計的各式套件,正是當今學習資料科學相關工具的首選。
然而,R 語言的設計邏輯與一般的程式語言不同,工程師們過去學習程式語言的經驗,往往造成學習 R 語言的障礙,本課程將從 R 語言的基礎開始,讓同學們從課堂講解以及互動式上機課程中,得以徹底理解 R 語言的核心概念與精要,學習如何利用 R 語言問資料問題,並且從資料分析的角度撰寫效率良好同時具有高度可讀性的 R 語言代碼。
Discusses about Microsystems Technologies such as thin filling processing,additive processes,subtractive processes,sacrificial processes,electrodeposition,electroforming etc
How to plan a hadoop cluster for testing and production environmentAnna Yen
Athemaster wants to share our experience to plan Hardware Spec, server initial and role deployment with new Hadoop Users. There are 2 testing environments and 3 production environments for case study.
Testing in Production, Deploy on FridaysYi-Feng Tzeng
本議題是去年 ModernWeb'19 「Progressive Deployment & NoDeploy」的延伸。雖然已提倡 Testing in Production 多年,但至今願意或敢於實踐的團隊並不多,背後原因多是與文化及態度有些關係。
此次主要分享推廣過程中遇到的苦與甜,以及自己親力操刀幾項達成 Testing in Production, Deploy on Fridays 成就的產品。