具身智能系列讲座（三）| AIRS in the AIR 预告

Original AIRS 深圳市人工智能与机器人研究院

2024-09-15

6月20日 15:00-16:00

第 65 期

具身智能系列讲座（三）

具身智能是一种全新的人工智能理念，它区别于传统人工智能的观念，主张智能的产生不仅依赖算法和算力，还需要通过与实际世界的互动来实现。具身智能研究跨越了机器人学、人工智能、认知科学及神经科学等多个学科，旨在深化对智能本质的理解。

AIRS in the AIR推出具身智能系列讲座，旨在汇集相关领域顶尖学者专家，探讨具身智能带来的新机遇，解决其面临的技术与应用挑战，并将智能机器人应用推向家庭、工业、医疗和探索等多个领域，促进人机交互的自然化和任务执行的效率化。

系列讲座第三期邀请爱丁堡大学信息学院副教授、自主智能体研究组负责人Stefano V. Albrecht，他将分享团队在深度强化学习与大语言模型领域的研究成果，包括深度强化学习在自动驾驶与多机器人仓储场景的应用和大语言模型在家用机器人中的应用等，并且提出他对于大语言模型智能体研究的观察与思考。

执行主席

林天麟

AIRS智能机器人中心主任

香港中文大学（深圳）理工学院助理教授

报告嘉宾

Stefano V. Albrecht

爱丁堡大学信息学院副教授

Dr. Stefano V. Albrecht is Associate Professor in Artificial Intelligence in the School of Informatics, University of Edinburgh. He leads the Autonomous Agents Research Group which specialises in developing machine learning algorithms for autonomous systems control and decision making, with a particular focus on reinforcement learning and multi-agent interaction. In his roles as Royal Academy of Engineering and Royal Society Industrial Fellow, he actively develops industry applications in the areas of multi-robot warehouses with Dematic/KION, and autonomous driving with Five AI which completed one of the most extensive urban road trials of autonomous driving in London before being acquired by Bosch in 2022. Dr. Albrecht is affiliated with the Alan Turing Institute where he leads the Multi-Agent Systems theme. In 2022, he was nominated for the IJCAI Computers and Thought Award based on his research which introduced Stochastic Bayesian Games and optimal solution algorithms, which have since been applied in a range of domains. Previously, Dr. Albrecht was a postdoctoral fellow at the University of Texas at Austin working with Prof. Peter Stone. He obtained PhD and MSc degrees in Artificial Intelligence from the University of Edinburgh, and a BSc degree in Computer Science from Technical University of Darmstadt. He is co-author of the new MIT Press textbook "Multi-Agent Reinforcement Learning: Foundations and Modern Approaches" which is freely available at .

报告介绍

报告主题：From Deep Reinforcement Learning to LLM-based Agents: Perspectives on Current Research

报告嘉宾：Stefano V. Albrecht

Since the recent successes of large language models (LLMs), we are beginning to see a shift of attention from deep reinforcement learning to LLM-based agents. While deep RL policies are typically learned from scratch to maximise some defined return objective, LLM-agents use an existing LLM at their core and focus on clever prompt engineering and downstream specialisation of the LLM via supervised and reinforcement learning techniques. In this talk, I will first provide a broad overview of my group’s research in deep RL, which focuses among other topics on developing sample-efficient and robust RL algorithms for both single- and multi-agent control tasks, including industry applications in autonomous driving and multi-robot warehouses. I will then present our recent research into LLM-agents, where we propose an approach for household robotics that takes into account user preferences to achieve more robust and effective planning. I will conclude with some personal observations about the state of LLM-agent research: (a) many papers in this field follow essentially the same recipe by focussing on prompt engineering and downstream specialisation; (b) this recipe makes their scientific claims brittle as they depend crucially on the specific LMM engine, and (c) LLMs are not natively designed to maximise objectives for optimal control and decision making. Based on these observations, I believe some fruitful research avenues can be identified.

活动时间

2024年06月20日 15:00 - 16:00

活动地点

香港中文大学（深圳）诚道楼207

参与方式

扫码报名，报名成功将通过短信告知

AIRS in the AIR 为 AIRS 重磅推出的系列活动，与您一起探索人工智能与机器人领域的前沿技术、产业应用、发展趋势。2022年举办至今，已邀请百余位国内外嘉宾，吸引了超40万人次参与。

AIRS in the AIR 往期视频回顾请前往哔哩哔哩观看：https://www.bilibili.com/video/BV1yr4y1z7zr/?spm_id_from=333.788

（如有感兴趣的主题或合作意向，欢迎在 AIRS 公众号后台留言。）

林天麟教授团队在IEEE T-RO发表论文，提出一种驱动集成球形齿轮机构

机器人能为未来的养老做什么？

具身智能简史、现状与未来展望