个人简介
教授,国家级青年人才,主要研究领域包括智能机器人、灵巧操作和机器学习。于2015年在美国杜克大学获得博士学位,曾任Google X资深研究科学家和Figure AI技术总监。此前研发了基于云边协同和人工智能的机器人操作系统平台,搭建了软硬一体的类人形机器人,落地了多种形态的工业级产品,申请国际专利十余项。在机器人和机器学习领域的顶级学术会议与期刊(ICRA/RSS/IROS/ICML/Neurips等)发表学术论文数十篇,包括实现了通过单一样本示教习得可泛化的复杂操作技能,操作精度达到亚毫米,提名RSS最佳论文。
研究方向
智能机器人 (Intelligent Robotics)
灵巧操作 (Dexterous Manipulation)
机器学习 (Machine Learning)
研究项目
1.基于云边协同和人工智能的机器人编程软件
在机器人操作系统层面,团队开发了全球首个面向工业机器人的基于云边协同与AI驱动的跨平台编程系统,实现了用户无需具备专业知识即可进行机器人任务编排。该系统显著提升了工业机器人在复杂装配场景(如家具、电子元器件)中的灵活性与可部署性,技术已获得8项国际专利,成果成功转化为Intrinsic公司首发产品并在Google X考核中获得最高评分,交付全球多家车企与消费电子厂商使用。
2.通过单一样本示教习得可泛化的复杂操作技能
在机器人技能学习方面,团队实现了单一样本示教即可泛化复杂操作技能的突破。该技术通过创新的视觉感知与仿真学习方法,摆脱了对大量真实数据的依赖,仅需单一示范即可实现亚毫米级精度的复杂操作。团队提出的基于视觉的物体抓取与操作方法,能够在仿真环境中学习技能并直接迁移至真实机器人,解决了传统方法在非结构化环境中的适应性问题。该成果已在ICRA、RSS等机器人领域顶级会议发表,并在Google X实现技术落地,为农业场景中的精细操作提供了核心技术支撑。
3.创新型灵巧操作技能学习框架
在系统架构层面,团队构建了高性能的端到端灵巧操作框架。该框架集成了意图识别、多模态感知、实时反馈控制与故障恢复机制,突破了传统方法在复杂操作任务中的性能瓶颈。其中多模态感知模块融合视觉、触觉等多种传感信息,实时反馈控制模块确保操作的精确执行,故障恢复机制则提升了系统的鲁棒性。该框架获得IROS评审委员会的高度评价,被认为是对实际部署场景具有深刻洞察的系统性工作。
发表论文
-
2026 ICRA
SAGE: Scene Graph-Aware Guidance and Execution for Long-Horizon Manipulation Tasks
-
2026 AAAI
FoAM: Foresight-Augmented Multi-Task Imitation Policy for Robotic Manipulation
-
2025 IROS
Splatter Joint: 3D Gaussian Splatting for Articulated Objects
-
2024 IROS
Stick Roller: Precise In-Hand Stick Rolling with a Sample-Efficient Tactile Model
-
2023 ICRA
Allowing Safe Contact in Robotic Goal-Reaching: Planning and Tracking in Operational and Null Spaces
-
2023 ICRA
Cable Routing and Assembly using Tactile-driven Motion Primitives
-
2023 ICRA
Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning
-
2023 IFAC
Prim-LAfD: A Framework to Learn and Adapt Primitive-Based Skills from Demonstrations for Insertion Tasks
-
2022 RSS (Best Paper Award Finalist)
You Only Demonstrate Once: Category-Level Manipulation from Single Visual Demonstration
-
2022 ICRA
CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation
-
2022 ICRA
Symbolic State Estimation with Predicates for Contact-Rich Manipulation Tasks
-
2021 IROS
Benchmarking Off-The-Shelf Solutions to Robotic Assembly Tasks
-
2021 RA-L
Robotic Cable Routing with Spatial Representation
-
2021 RSS
Robust Multi-Modal Policies for Industrial Assembly via Reinforcement Learning and Demonstrations: A Large-Scale Study
-
2021 ICRA
Learning Dense Rewards for Contact-Rich Manipulation Tasks
-
2021 ICRA
Interpreting Contact Interactions to Overcome Failure in Robot Assembly Tasks
-
2017 KDD
Convex factorization machine for toxicogenomics prediction
-
2015 ICML (Appendix)
A Multitask Point Process Predictive Model
-
2015 AAAI (Appendix)
Integrating Features and Similarities: Flexible Models for Heterogeneous Multiview Data
-
2015 COSI
Task-driven Adaptive Sensing on Quadrupole Mass Filter Systems for Classification
-
2015 Signal Process.
Multivariate Time-Series Analysis and Diffusion Maps
-
2014 NIPS (Appendix)
Analysis of Brain States from Multi-Region LFP Time-Series
-
2014 Submitted
A Bayesian Framework for Multi-Modality Analysis of Mental Health
-
2014 NIPS Workshop
Bayesian Multitask Distance Metric Learning
-
2014 NIPS Workshop
Bayesian Multiview Factor Modeling for Integrating and Analyzing Heterogeneous Clinical Data
-
2014 ICML (Appendix)
Modeling Correlated Arrival Events with Latent Semi-Markov Processes
-
2013 TBME
Sorting Electrophysiological Data via Dictionary Learning & Mixture Modeling