The document discusses optimization techniques for deep learning frameworks on Intel CPUs and Fugaku aimed architectures. It introduces oneDNN, a performance library for deep learning operations on Intel CPUs. It discusses issues with C++ implementation, and how just-in-time assembly generation using Xbyak can address these issues by generating optimal code depending on parameters. It also introduces Xbyak_aarch64 for generating optimized code for Fugaku's Scalable Vector Extension instructions.
cvpaper.challenge の Meta Study Group 発表スライド
cvpaper.challenge はコンピュータビジョン分野の今を映し、トレンドを創り出す挑戦です。論文サマリ・アイディア考案・議論・実装・論文投稿に取り組み、凡ゆる知識を共有します。2019の目標「トップ会議30+本投稿」「2回以上のトップ会議網羅的サーベイ」
http://xpaperchallenge.org/cv/
The document discusses optimization techniques for deep learning frameworks on Intel CPUs and Fugaku aimed architectures. It introduces oneDNN, a performance library for deep learning operations on Intel CPUs. It discusses issues with C++ implementation, and how just-in-time assembly generation using Xbyak can address these issues by generating optimal code depending on parameters. It also introduces Xbyak_aarch64 for generating optimized code for Fugaku's Scalable Vector Extension instructions.
cvpaper.challenge の Meta Study Group 発表スライド
cvpaper.challenge はコンピュータビジョン分野の今を映し、トレンドを創り出す挑戦です。論文サマリ・アイディア考案・議論・実装・論文投稿に取り組み、凡ゆる知識を共有します。2019の目標「トップ会議30+本投稿」「2回以上のトップ会議網羅的サーベイ」
http://xpaperchallenge.org/cv/
We have published a document, "A Global Data Infrastructure for Data Sharing Between Businesses".
This document introduces the current trends toward the implementation of digital management tools that support cross border data sharing between businesses, which will be indispensable for future business transformations and pandemic responses. Today we find ourselves at the confluence of multiple evolving global trends. These include the emergence of new data driven business models, the expansion of B2B platform business, the accelerating pace of digital transformation, the growing expectations for the fulfillment of Sustainable Development Goals (SDGs) and other social needs, the rise of New Glocalism, the growth of stakeholder capitalism, and the Great Reset. In this article, we discuss the challenges of establishing a global data infrastructure for data sharing between businesses as a key ICT infrastructure for the construction of a next generation society, and the efforts that are being made to address these challenges.
NTT Laboratories
J. Arai, S. Yagi, H. Uchiyama, T. Honjo, T. Inagaki, K. Inaba, T. Ikuta, H. Takesue, K. Horikawa
This material is a poster exhibited at the ITBL community booth in SC19 (The International Conference for High Performance Computing, Networking, Storage, and Analysis 2019).
NTT Software Innovation Center
Hiroki Miura, Kota Tsuyuzaki, Junya Arai, Kohei Yamaguchi, Kengo Okitsu, Shinji Morishita
This material is a poster exhibited at the ITBL community booth in SC19 (The International Conference for High Performance Computing, Networking, Storage, and Analysis 2019).
NTT is developing a hybrid sourcing approach to address Japan's projected shortage of 430,000 IT engineers by 2025, known as the "Digital Cliff 2025". Their approach combines crowdsourcing, using platforms like Topcoder, with innersourcing by decomposing projects into microtasks that can be completed by both internal and external workers. In a case study, they developed a B2B application using this hybrid model, with crowdsourced and innersourced workers completing 49% and 39% of the code respectively. They aim to create a framework to promote this hybrid sourcing approach within NTT to help organizations overcome skills shortages and achieve digital transformations.
1) The document proposes a method for layer-level pruning of ResNet models to reduce computation costs during inference.
2) It introduces weights to Residual Units to determine their importance, allowing less important units to be erased. Units with small absolute weight values on their nonlinear maps can be erased with little impact.
3) The method repeats training and erasing layers based on unit importance. It erases layers after training and retrains, iteratively erasing more layers until accuracy drops, to prune the model while maintaining performance.
Edge computing solves issues with IoT deployment like data privacy and volume. It also allows companies to gain valuable customer and product data rather than relying on web giants. For CIOs, edge computing influences strategies around data infrastructure, organization, and IT architecture - shifting from offline to real-time analytics, human-readable to machine formats, and app-centric to data-centric designs.
BuildKit is a next-generation build system that provides efficient caching, multi-stage builds, and secure access to private assets without requiring root privileges. It can be deployed on Kubernetes using a DaemonSet or StatefulSet for caching benefits. Build definitions can be provided via Dockerfiles, Buildpacks, or CRDs like Tekton to build images on Kube nodes and push to a remote registry. Consistent hashing with StatefulSets ensures builds always hit the fastest daemon-local cache.
The document discusses utilizing spatiotemporal data from IoT devices in Redis. It proposes using a technique called "ST-coding" to encode location and timestamp data into a single code. This addresses two problems: 1) ST range queries were slow due to searching many keys; and 2) data insertion was inefficient due to load concentration on a single Redis server. By splitting the ST-code into a "PRE-code" and "SUF-code", ST range queries can be performed on a single key, avoiding use of the slow KEYS command. This improves query performance and distributes load across Redis servers.
The document discusses challenges for implementing persistent memory (PMEM) aware applications using the Persistent Memory Development Kit (PMDK). It describes how to use PMDK with Direct Access (DAX) filesystems and outlines challenges in rewriting PostgreSQL to use PMEM, including resizing checkpoint files and selecting appropriate sync functions for write ahead logging (WAL) files. Performance evaluation challenges are also discussed.