![]() |
| Track Chair: Shahzad Ashraf, Hohai University, China |
The explosive growth of multimedia data—encompassing images, speech, signals, and videos—across diverse industries has spurred an urgent demand for intelligent, efficient, and robust processing technologies. Specifically, integrating advanced algorithms such as deep learning, multi-modal fusion, and real-time computing into multimedia processing pipelines to elevate accuracy, speed, and adaptability has emerged as a pivotal trend. This special session aims to address the unique challenges of image, speech, signal, and video processing by focusing on cutting-edge technologies including intelligent perception, feature extraction, noise reduction, and cross-modal understanding. It provides a premier forum for academic researchers and industry practitioners to exchange insights, share breakthroughs, and develop innovative solutions for enhancing the analysis, understanding, and application of multimedia data in real-world scenarios. Through this collaborative endeavor, the session seeks to drive technological innovation in multimedia processing and empower sectors such as security, healthcare, communication, and entertainment to embrace a new era of intelligent multimedia applications.
Signal, image and video processing
Image and video analysis and understanding
Audio and acoustic processing and analysis
Segmentation, features and descriptors
Texture and color analysis
Automatic speech and speaker recognition
Multimedia analysis, indexing and retrieval
Vision sensors
Medical image and signal analysis
Computer-aided detection and diagnosis
Image guidance and robot guidance of interventions
Face recognition
Image compression, coding, and encryption
Graph theory in image processing
Natural language processing