Co-located with the IEEE International Conference on Multimedia and Expo 2026
5 July - 9 July 2025, Bangkok, Thailand
Embodied Intelligence is redefining AI by shifting focus from abstract reasoning to learning through physical interaction. In embodied systems, multimedia serves as the essential medium for perception, communication, and action, enabling intelligent agents to see, hear, sense, and act within dynamic environments. Because embodied AI creates closed-loop “Perception-Decision-Action” structures, the behavior of such systems emerges from continuous environmental engagement rather than static dataset processing.
Our workshop brings together researchers from signal processing, computer vision, robotics, and network communications to address critical challenges in building next-generation intelligent systems. We propose a Five-Layer Unified Architecture encompassing:
With spatial computing devices and humanoid robots becoming reality, the time is ripe for establishing Embodied Multimedia as a coherent research agenda.
We invite submissions on topics including but not limited to:
Papers must adhere to the standard ICME 2026 format (IEEE conference style, double-column, up to 6 pages including references) and be submitted via:
The template can be found via:
All Submissions will undergo a double-blind peer review process. Accepted papers will be presented by the authors and be included in the IEEE Xplore.
To be Confirmed...
Tongji University, China
Dr. Yang Liu is an Assistant Professor at Tongji University, China. Dr. Liu received his Ph.D. from Fudan University (2025) and was a visiting scholar at the University of Toronto. His research focuses on anomaly detection in embodied systems. He has published in ACM CSUR, IEEE TIP, TII, and TIE, and served as Guest Editor for IEEE TCSS, Area Chair for BMVC 2025 and IEEE ICIP 2025, and Workshop/Special Session Co-Chair for IEEE ICASSP, ICIP, and DSP.
University of British Columbia, Canada
Dr. Jing Liu is a Postdoctoral Research Fellow at The University of British Columbia, Canada. Dr. Liu received his Ph.D. from Fudan University (2023). His research interests include edge-cloud collaboration, anomaly detection, and multimodal learning. He has published multiple first-author papers in top tier venues and serves as a reviewer for IEEE TII, TCSVT, IoTJ, TITS, TMC, TNNLS, ACM CSUR, and as a TPC member for CVPR, ECCV, ICCV, NeurIPS, ICLR, IJCAI, and ACM MM.
Cardiff University, UK
Dr. Wei Zhou is an Assistant Professor (UK Lecturer) at Cardiff University. Wei’s research interests mainly focus on perceptual image processing, multimodality, and visual computing for healthcare. Dr Zhou has published over 70 papers in recent years, including publications in top-tier venues, e.g., IEEE TIP, IEEE TMM, IEEE TCSVT, IEEE TMI, CVPR, ACM MM, MICCAI, etc. Wei serves as General Chair for the 1st Cardiff Image & Vision Computing Workshop and Chair for the Elections Committee of IEEE UK & Ireland SPS Chapter. Wei is now an Associate Editor of IEEE Transactions on Neural Networks and Learning Systems (TNNLS), Pattern Recognition, Neurocomputing, Springer Signal, Image and Video Processing, and Human-centric Computing and Information Sciences. Wei has also served as the Topic Editor and Guest Editor for many journals, such as Elsevier Displays; the Area Chair for ACM MM 2024, ICME 2025, and IJCNN 2025; the Lead Special Session Chair for IEEE ICME 2025, IEEE QoMEX 2025, IEEE MMSP 2023, and the Special Session Co-Chair for IEEE ICIP 2025 and 2024.
Tongji University, China
Dr. Lulu Guo is currently a Research Professor at Tongji University, Shanghai, China. He received the B.S. degree in vehicle engineering and the Ph.D. degree in control engineering from Jilin University, Changchun, China, in 2014 and 2019, respectively. Before joining Tongji University, he was a Postdoctoral Research Associate with the University of Georgia, Athens, GA, USA. His current research interests include advanced vehicle control, energy management, and vehicle cybersecurity.
Cardiff University, UK
Dr. Minghao Zou is a postdoctoral researcher at Cardiff University. His research interests span computer vision and process mining, with a particular focus on action recognition, object detection, and human-object interaction detection. He serves as a reviewer for several journals and conferences, such as IEEE TCSVT, IEEE TNNLS, ACM MM, IEEE TCSS, ACM TOMM, and Pattern Recognition.
To be Confirmed...