This lecture provides advanced multimedia processing and learning. Multisensory signal, video, audio, and language are core components of multimedia. Multimodal learning with them is one of the core technologies in multimedia in real-world applications such as intelligent surveillance, smart TVs, and human-machine interface systems. Lecture topics include the basics of multimedia learning, which is image, video, audio, and language representation learning, multimedia fusion schemes, multimedia alignment, and multimedia attention. In addition to the basics of multimodal, students are to participate in term projects and papers reading recently published in multimodal processing and learning.
Copyright ⓒ 2015 KAIST Electrical Engineering. All rights reserved. Made by PRESSCAT
Copyright ⓒ 2015 KAIST Electrical Engineering. All rights reserved. Made by PRESSCAT
Copyright ⓒ 2015 KAIST Electrical Engineering. All rights reserved. Made by PRESSCAT
Copyright ⓒ 2015 KAIST Electrical
Engineering. All rights reserved.
Made by PRESSCAT