Gunosyデータマイニング研究会 #97でA/Bテストに関して述べている KDD2007の論文"Practical Guide to Controlled Experiments on the Web:
Listen to Your Customers
not to the HiPPO"を紹介した記事になります。著者はMicrosoftの方です。
Gunosyデータマイニング研究会 #97でA/Bテストに関して述べている KDD2007の論文"Practical Guide to Controlled Experiments on the Web:
Listen to Your Customers
not to the HiPPO"を紹介した記事になります。著者はMicrosoftの方です。
This document describes the 2017 IEEE CIG Game Data Mining Competition hosted by Sejong University in South Korea. The competition provided access to game log data from Blade & Soul to predict player churn and survival time. There were two tracks - one for churn prediction and one for survival analysis. 13 teams participated in track 1 and 5 teams in track 2. The winning team YD from Japan used ensemble methods like LSTM, DNN and extra trees for track 1 and ensemble conditional inference trees for track 2. Other top techniques included random forest and light gradient boosting machines. The competition helped advance game data mining research by providing a large real-world dataset.
A/B Testing at Pinterest: Building a Culture of Experimentation WrangleConf
The document discusses building a culture of experimentation at Pinterest and outlines a maturity model for experimentation. It describes 5 stages for experimentation maturity: get started, get big, get better, get out, and get tools. For each stage, it identifies common problems, such as underutilization or needing guidance, and provides recommendations for how to address them, like evangelizing experiments or implementing processes to help scale experimentation. The overall aim is to establish a systematic approach to experimentation that helps transition an organization from initial experimentation to widespread experimentation supported by automation and analysis tools.
1. The document discusses RESTful APIs and gRPC, comparing their characteristics and use cases.
2. RESTful APIs typically use HTTP and JSON to access resources via URLs while gRPC uses protocol buffers and HTTP/2 for efficient streaming and RPC.
3. gRPC is better suited for microservices and mobile apps due to its ability to handle streaming and performance, while REST is more widely used due to its simplicity and support in most languages.
Apache Kudu - Updatable Analytical Storage #rakutentechCloudera Japan
This document provides an overview of Apache Kudu, an open source columnar storage system that enables fast analytics on fast changing data. It discusses Kudu's architecture including its use of tablets, replication using Raft consensus, and columnar storage with compression. The document also covers Kudu's write path involving memstores, delta memstores, and flushing to disk; its read path involving lookups without merging files; and compaction processes. Overall, the summary provides a high-level technical introduction to Kudu's capabilities and design.
This document describes the 2017 IEEE CIG Game Data Mining Competition hosted by Sejong University in South Korea. The competition provided access to game log data from Blade & Soul to predict player churn and survival time. There were two tracks - one for churn prediction and one for survival analysis. 13 teams participated in track 1 and 5 teams in track 2. The winning team YD from Japan used ensemble methods like LSTM, DNN and extra trees for track 1 and ensemble conditional inference trees for track 2. Other top techniques included random forest and light gradient boosting machines. The competition helped advance game data mining research by providing a large real-world dataset.
A/B Testing at Pinterest: Building a Culture of Experimentation WrangleConf
The document discusses building a culture of experimentation at Pinterest and outlines a maturity model for experimentation. It describes 5 stages for experimentation maturity: get started, get big, get better, get out, and get tools. For each stage, it identifies common problems, such as underutilization or needing guidance, and provides recommendations for how to address them, like evangelizing experiments or implementing processes to help scale experimentation. The overall aim is to establish a systematic approach to experimentation that helps transition an organization from initial experimentation to widespread experimentation supported by automation and analysis tools.
1. The document discusses RESTful APIs and gRPC, comparing their characteristics and use cases.
2. RESTful APIs typically use HTTP and JSON to access resources via URLs while gRPC uses protocol buffers and HTTP/2 for efficient streaming and RPC.
3. gRPC is better suited for microservices and mobile apps due to its ability to handle streaming and performance, while REST is more widely used due to its simplicity and support in most languages.
Apache Kudu - Updatable Analytical Storage #rakutentechCloudera Japan
This document provides an overview of Apache Kudu, an open source columnar storage system that enables fast analytics on fast changing data. It discusses Kudu's architecture including its use of tablets, replication using Raft consensus, and columnar storage with compression. The document also covers Kudu's write path involving memstores, delta memstores, and flushing to disk; its read path involving lookups without merging files; and compaction processes. Overall, the summary provides a high-level technical introduction to Kudu's capabilities and design.
This slides were used at "5th Machine Learning 15minetes!" http://machine-learning15minutes.connpass.com/event/40294
Introduce important things to tackle machine learning in a company.