Introduction to Matsuo Laboratory (ENG).pptx

©︎MATSUO LAB, THE UNIVERSITY OF TOKYO
Introduction to Matsuo Lab
March 2024
Photography, video
recording and
disclosure to third
parties without
permissions are strictly
prohibited.

©︎MATSUO LAB, THE UNIVERSITY OF TOKYO 2
Matsuo Lab startups
And more
Matsuo Lab, Graduate School of Engineering, The University of Tokyo
• Matsuo Lab belongs to the Graduate School of Engineering at The University of Tokyo and
specializes in Artificial Intelligence (AI) and Web Engineering research
• Comprised of 200+ members including staff and students
• + startups in various application area (including robotics, NLP, vision, …)
Researchers
(about 10
people)
Staff
(About 50
people)
Assigned
students
(About 50
people)
TAs
(About 20
people)
Prof. Matsuo
Masters and doctoral courses: Technology Management for
Innovation, Graduate School of Engineering
Department: Systems Innovation Program, Faculty of
Engineering
＋
Areas of specialization
 Deep Learning (2011 ~ )
R&D on cutting-edge technology
(Deep Learning)
Application to image recognition
and robotics
 Web engineering (2002 ~ )
Analysis of social media, big data,
etc.
Design and operation of web
services

Representative Officer
Prof. Yutaka Matsuo
1997 Graduated from the Faculty of Engineering at The University of Tokyo
with a Bachelor’s degree in Information and Communication Engineering
2002 Completed a doctoral program and earned a doctorate in engineering at
the Graduate School of Engineering at The University of Tokyo
Became a researcher with the National Institute of Advanced Industrial
Science and Technology (AIST)
From Oct. 2005. Visiting Scholar, Stanford University
From Oct. 2007 Associate Professor, Institute of Engineering Innovation / Center for
Structuring of Knowledge / Department of Technology Management for
Innovation, Graduate School of Engineering, The University of Tokyo
From 2014. Joint Representative and Project Associate Professor, Chair for Global
Consumer Intelligence, Department of Technology Management for
Innovation, Graduate School of Engineering, The University of Tokyo
2012–14 Editor-in-Chief, Transactions of the Japanese Society for Artificial
Intelligence, then Chair of Ethics Committee (present post)
June 2017 Founder and Director, Japan Deep Learning Association (JDLA)
From April 2019 Professor of Research Into Artifacts, Center for Engineering (RACE) /
Department of Technology Management for Innovation, Graduate
School of Engineering, The University of Tokyo
From June 2019 Concurrent Outside Director, SoftBank Group Corp.
From Oct. 2021 Member, Council of New Form of Capitalism Realization
Japan Deep Learning Association

Building an Ecosystem
• Matsuo lab is aiming to build an ecosystem where the results of research are not kept within academia,
but widely shared in the form of startups and services, so that the benefits of those economic activities
recirculate and promote further research, in The University of Tokyo and the Hongo area.
Implementation
Fundamental research
Education
Incubation
(eventually becoming
big companies)
Expertise, resources, etc.
for success to be returned
to the academia.
Return the know-how and
resources for success to the
university
Nurturing of
technological
“seeds” and let
them in society
Provide advanced education base on
fundamental research
Provide practical learning
opportunities through classroom
lectures, OJT and participation in
joint researches
Create a spiral of innovation

Overview
3.
Implementation
1.
Fundamental
Research
2.
Education
4.
Incubation
R&D focused on Deep Learning and its application,
aiming to make machines smarter and explain the
principles of intelligence
Promotion of DX in industry through joint
researches on Deep Learning in collaboration
with private companies
Development and provision of
human resource development
programs for students and
adults (not limited to students
and faculty members of The
University of Tokyo)
Nurture and support
launch of startups from
universities and
laboratories by
providing
entrepreneurship
education

Fundamental Research | Team Mission
■Mission
Create intelligence and discover the
principle of human intelligence
1. How the human brain works is one of the big
mysteries
2. Making machine smarter have a huge social impact
■Research Fields
1. Algorithm of deep learning (RL, Generative Models,
Transfer Learning)
2. Application of deep learning (NLP, Vision, Robotics)

Fundamental Research | World Models
• The key to future Deep Learning development is technology to simulate the real world, and the machine equivalent of
human imagination.
• The world’s leading AI companies are already conducting ongoing research on this topic.
Sources: https://deepmind.com/blog/article/neural-scene-representation-and-rendering, https://worldmodels.github.io
Technology Overview Examples
Humans can use imagination to compensate for gaps in information
and to posit future conditions.
E.g. Imagining future from the
current state
Similarly, the key to the future development of AI is to efficiently learn
“common sense” from experiences, and to be able to “imagine” the
future. The core technology used is World Models.
CRASH!!
E.g. Looking at a part of an
object and imagining the whole
picture
Leading companies and laboratories like Google focus on World Models are
conducting research in this topic
Example from DeepMind (Google): Reconstructing an entire
object or group of objects from a series of limited views
The object of this game is to avoid the bullets. The
system’s ability to avoid being hit is improved by
incorporating an efficient mechanism to imagine
the future
“Neural scene representation and rendering”, S. A. Eslami, et al., Science,
360(6394):1204–1210, 2018.
Based on images from
three different perspectives,
AI reconstructs the 3D view.
Example from Google Brain: Efficient prediction of future events
“Recurrent world models facilitate policy evolution”. D. Ha, J. Schmidhuber,
NeurIPS 2018, pp. 2455–2467, 2018.

Our fundamental research on the world model
• Object-centric world models:
• A framework for recognizing and predicting representations for each object in an image
or video without explicit supervision.
• We propose a model that separates the representations of objects that are related to
interaction (dynamic representations, such as positions) and those that are not related
(global representations, such as colors).
• We successfully separated the representations so that we can change only the color of the
object without changing its position.
[Greff+ 20]
Dynamic
representation Global
representation

Workshops on the world model
• Organized session on the world model at JSAI2023 (the largest
conference on AI in Japan)
• Workshop on world models at IROS2023 (top-tier robotics
conference)

Changes in automated driving technology using the world model
Current Pipeline Processing
1. Recognizes the presence of a bicycle in front of the car
2. Predicts the path of a bicycle
3. Recognizes that there are an obstacle in the path of the car
4. Judges that the bicycle will be in the path of the car due to
the presence of a telephone pole
5. Generate path to avoid the bicycle
6. Determines the amount of control to run along the path
1. Input current sensing data into the world model
2. Output predictions from the world model
3. Generate a path that does not conflict with the predicted bicycle
path
4. Determines the amount of control to run along that path
5. Output control amount from the world model as well
Example: A telephone pole and a
bicycle running are in front of
your car
Given as a rule

Fundamental Research | Application (Robotics)
• By applying deep learning, we aim to develop intelligent robotics systems
• Combining recent progress in deep learning (including LLMs), our tidy up robot system won 1st prize at the RoboCup23
(Japan) and 3rd prize in RobotCup23 (world competition)。

Fundamental Research | Research on prompt engineering
We are also researching on large language models. Our research member, Takeshi Kojima,
found prompt, “Let’s think step by step”, which elicits the logical knowledges and improve
logical reasoning.
Standard Prompting Proposed Prompting (Zero-Shot CoT)
• LLM are typically give poor performance on
multi-step reasoning (e.g. math)
• Internal working of the LLMs is also unclear
• Simply add a magical phrase (known as
prompt), “Let’s think step by step” elicit logical
knowledge
• Improve reasoning performance
e.g., MultiArith (17.7% -> 78.7%)
”Large Language Models are Zero-Shot Reasoners”, NeurIPS2022, (900+ citations at 2023/11/21)

LLM Model"Weblab-10B" from Matsuo Lab. （2023/8/18）
• Developed a Large Language Model (LLM) with 10 billion parameters for Japanese and English by pre-
training and post-training (fine tuning), and released the model to the public.
• The model was released to the public by pre-training and post-training (fine tuning). The model was designed to
increase the amount of training data by using not only Japanese but also English datasets for training, and to
improve the accuracy of Japanese by transferring knowledge between languages.
• This is the highest level of publicly available model in Japan.

Fundamental Research | Accepted publications
• Top-tier International Conferences ：ICLR, NeurIPS, ICML, AAAI,
IJCAI etc…
10 top-tier papers accepted
2020 2021 2022 2023
3
8
10
4月〜3月までの採録数
Number of researchers
doubled from 10
3
7
20
(20 FTE)
11
10
(8.1FTE)
We are also building new technologies that are completely different from traditional deep learning toward an
innovative theory that connects the brain and AI.
2020 2021 2022 2023

Fundamental Research | Accepted conferences
15
Deep Generative and World Models
• “A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation”, ICLR2023
• “End-to-End Training of DBMs by Unbiased Contrastive Divergence with Local Mode Initialization”, ICML2023
• “DreamSparse: Escaping from Plato’s Cave with 2D Frozen Diffusion Model given Sparse Views”, NeurIPS2023
Reinforcement Learning and Robotics
• “Control Graph as Unified IO for Morphology-Task Generalization”, ICLR2023 (Spotlight)
• ICLR2021, ICML2021, NeurIPS2021, ICLR2022 (Spotlight) etc.
Transfer learning
• “Collective Intelligence for 2D push Manipulations with Mobile Robots”, RA-L, 2023
• “Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization”, NeurIPS2021 (Spotlight)
• IJCAI2022 etc.
Natural language processing (NLP)
• “Large-Language Models are Zero-Shot Reasoners”, NuerIPS2022
• EMNLP2021×2, ACL2021, NAACL2022×2, EMNLP2022×2，ICLR2023
Theory
• “Regularization and Variance-Wighted Regression Achieves Minmax Optimality in Linear MDPs: Theory and Practice”, ICML2023
• "Group Equivariant Conditional Neural Processes“, ICLR2021

Overview
3.
Implementation
1.
Fundamental
Research
2.
Education
4.
Incubation
Nurture and support
universities and
laboratories by providing
entrepreneurship
education
R&D focused on Deep Learning and its application,
aiming to make machines smarter and explain the
principles of intelligence

Education | Overview of Lectures
Web Engineering and
Business Models
Web工学
Fundamental Projects
Web Engineering
Chair for Global Consumer
Intelligence
(Cultivation of data
scientists)
Data Science Business
Models
Web工学
Deep Learning
Basic courses
Deep Learning
Web工学
Entrepreneurship
• In AY 2022-23, Matsuo lab provides 15 lectures covering the following topics:
• “Web engineering” provides lectures on a fundamental technology for user experience design
• “Data Science” focus on data-science technology and its applications in business and management
• “Deep Learning” covering topics from the basis of Deep Learning to cutting-edge knowledge
• “Entrepreneurship” offers ideas and knowledge on starting a technology-based company
Intensive courses
Spring seminar
(Image recognition)
Summer school
（Financial trading and machine learning,
Deep Generative Model,
Natural Language Processing)
Entrepreneurship
Education Design
Donation Course
Invitation to Deep Tech
Entrepreneurs
Visionary startups
Chair for AI Business
Transformation
Chair for World Models Seminar on data-driven
entrepreneurship
Seminar on data-driven
business proposition

Education | Results (Attendance)
• From April 2014 to February 2021, a total of over 7,500 students and working adults
attended the courses.
• In AY 2023-24 alone, attendance is expected to surpass 11,000, raising cumulative
attendance to over 200,000 in the coming year.
0
500
1000
1500
2000
2500
3000
3500
4000
4500
5000
2014 2015 2016 2017 2018 2019 2020 2021
To approx.
11,000
2022 2023
5,700

Overview
3.
Implementation
1.
Fundamental
Research
2.
Education
4.
Incubation
R&D focused on Deep Learning (World Models)
aiming to achieve smarter machines and explain
the principles of knowledge
Nurture and support
universities and
laboratories by
providing
entrepreneurship
education

Social Implementation | Results and Case Studies
• Matsuo Lab is pursuing the application of research in a wide variety of topics, industries or types of
data, such as self-driving cars, object detection, image processing, behavior analysis or forecasting.
Theme Industry Project Overview
Image analysis
Healthcare
Development of image diagnosis algorithms to assist in the diagnosis
diagnosis of major dementias such as Alzheimer’s disease, by
detecting micro-hemorrhages in MRI scans
Behavior
analysis
Manufacturing
(Parts)
Detection and visualization of people’s movement in the factory to
factory to analyze the cause of defective product rates and devise the
devise the transfer of skilled workers’ expertise
Forecasting
Manufacturing
(Chemicals)
Early detection of abnormalities in chemical plants and identification of
identification of their causes
(Photos are for illustrative purposes
only.)

Overview
3.
Implementation
1.
Fundamental
Research
2.
Education
4.
Incubation
R&D focused on Deep Learning (World Models)
aiming to achieve smarter machines and explain the
principles of knowledge
Nurture and support
universities and
laboratories by
providing
entrepreneurship
education

Incubation | Kigyo (起業) Quest
• Kigyo Quest is a program that abstracts the success model of startups launched at Matsuo Lab and provide the
knowledge to increase the probability of success
• Launched in the summer of 2021
• By April 2022, there are 3 companies registered and currently preparing to launch
1st Stage Getting Your Weapons (Education)
 Attend certified Kigyo Quest classes to learn the basics of your technology (data science/deep learning).
2nd Stage Getting Stronger Through Practice (Application)
 Join a company as a certified AI engineering intern and gain experience in development, project management
and pitching proposals to clients as on-the-job training.
 Developing the skills and business strengths needed for launching your own business.
3rd Stage Forming Your Party, Starting Your Journey
 Find partners and set foot into the real business world.
 Gain experience in making proposals to clients and acquire the
knowledge and skills necessary to start up a company
New entrants
Senior engineer
Entrepreneur in the making
START

Incubation | 23 Startups Launched by Matsuo Lab
23
listed
• Since 2012, 23 start-ups (including 2 listed companies) have been launched by the graduates of Matsu Lab

Introduction to Matsuo Laboratory (ENG).pptx

Recommended

Recommended

More Related Content

Similar to Introduction to Matsuo Laboratory (ENG).pptx

Similar to Introduction to Matsuo Laboratory (ENG).pptx (20)

Recently uploaded

Recently uploaded (20)

Introduction to Matsuo Laboratory (ENG).pptx

Editor's Notes