Mitsuhiko Nakamoto
Hi, I'm Mitsuhiko! I am a third-year CS Ph.D. student in the Berkeley Artificial Intelligence Research (BAIR) Lab at UC
Berkeley, advised by Professor Sergey
Levine.
My research goal is to develop algorithms that empower robots with both high-level dexterity and
generalization capabilities, and to incorporate them into everyday life.
To this end, my research interests focus on data-driven approaches to solving real-world robotic
tasks, including topics such as offline reinforcement learning (RL), online RL fine-tuning, and
imitation learning.
Recently, I have been particularly interested in how to use RL to quickly
improve pre-trained generalist robot policies in test environments.
Prior to Berkeley, I received my bachelor's degree from University of Tokyo in March 2022, advised
by Professor Yoshimasa
Tsuruoka.
After graduating, I also had a great opportunity to spend the summer in Professor Yutaka Matsuo's Lab before moving to
Berkeley.
Email  / 
Google Scholar
 / 
GitHub  / 
Twitter  / 
Talks
|
|
Research
* denotes equal contribution, † denotes core contributor
|
Conference Papers and Pre-prints
|
Steering Your Diffusion Policy with Latent Space Reinforcement Learning
Andrew Wagenmaker†, Mitsuhiko Nakamoto†, Yunchu
Zhang†, Seohong Park, Waleed Yagoub,
Anusha Nagabandi, Abhishek Gupta†, Sergey Levine†.
Pre-print
arxiv link /
project page
We propose DSRL (Diffusion Steering via Reinforcement Learning), a sample-efficient and lightweight
approach for improving pre-trained diffusion- or flow-based policies by steering the initial noise
input through RL.
We successfully demonstrate real-world RL-based fine-tuning of Pi-0, a pre-trained generalist policy
from Physical Intelligence.
|
Journal Papers
Deep Learning Algorithm to Detect Cardiac Sarcoidosis From Echocardiographic Movies
S.Katsushika, S.Kodera, M.Nakamoto, K.Ninomiya, N.Kakuda, H.Shinohara, R.Matsuoka,
H.Ieki, M.Uehara, Y.Higashikuni, K.Nakanishi, T.Nakao, N.Takeda, K.Fujiu, M.Daimon, J.Ando,
H.Akazawa,
H.Morita, I.Komuro.
In Circulation Journal, 2021
paper
link
|
Deep learning model to detect significant aortic regurgitation using
electrocardiography
S.Sawano, S.Kodera, S.Katsushika, M.Nakamoto, K.Ninomiya, H.Shinohara,
Y.Higashikuni,
K.Nakanishi, T.Nakao, T.Seki, N.Takeda, K.Fujiu, M.Daimon, H.Akazawa, H.Morita, I.Komuro.
In Journal of Cardiology, 2021
paper
link
|
Automatic detection of vessel structure by deep learning using intravascular
ultrasound
images of the coronary arteries
H.Shinohara, S.Kodera, K.Ninomiya, M.Nakamoto, KS.Katsushika, A.Saito, S.Minatsuki,
H.Kikuchi, A.Kiyosue, Y.Higashikuni, N.Takeda, K.Fujiu, J.Ando, H.Akazawa, H.Morita, I.Komuro.
In PLOS ONE, 2021
paper
link
|
Hadware Applications
|
Teleoperated Robot
- This is a project done in this
course
- Designed the gripper and master interface for a teleoperated robot.
- Technologies: CoppeliaSim, Arduino, Servo Motor, 3D CAD design, 3D Printer, potentiometers
- [Click
here to see demo videos]
|
|
Remote Switch Toggler
- A device for turning on/off the electric switch remotely using a smartphone.
- Technologies: ESP32, Servo Motor, BLE socket programming, electronic design (EAGLE)
- [Click
here to see demo videos]
|
Software Applications
|
Flappy Chameleon (a smartphone game app)
- [App Store Link]
- 2017.10-2018.2
- Planned, designed, implemented by myself.
- Technologies: C#, Unity
|
|
Foolip
|
Services
Reviews: NeurIPS (2024, 2025), ICLR (2024, 2025), ICML (2025), ICRA (2025), CoRL(2025), RA-L
|
I am from Yokohama, Japan. My name in Japanese is 中本光彦.
During my free time, I enjoy playing and watching sports, especially soccer. I also enjoy playing
Japanese chess (aka Shogi).
|
|