ACM MM Asia 2021 Workshop on

Multi-Modal Embedding and Understanding

Gold Coast, Australia
December1-3, 2021


We humans perceive the physical world via multiple ways, e.g., watching, touching, hearing, and so on, which means that we process multi-modal information for environment perception. Multi-modal understanding plays a crucial role in enabling the machine with such ability. Due to its research significance, multi-modal embedding and understanding has gained much research attention and achieved much progress in the past couple of years. The recent advances in deep learning inspire us to explore more and deeper for the multi-modal embedding and understanding, such as self-supervised learning and pre-training in it. In this workshop, we aim to bring together researchers from the field of multimedia to discuss recent research and future directions for multi-modal embedding and understanding, and their applications.

Call for Papers

Multi-modal understanding are important and fundamental problems in the field of multimodal analysis, which have been attracting much research attention in recent years. Previous works have explored shallow embedding and understanding in many downstream tasks, including cross-modal retrieval, visual navigation, VQA, visual captioning, etc. To encourage researchers to explore new and advanced techniques in this area, we are organizing a workshop on “multi-modal embedding and understanding” with the conjunction of ACM MM Asia 2021, and calling for contributions. The included (but not limited) topics are as follows:

  1. Large-scale pre-training for multi-modal embedding and understanding
  2. Self-supervised learning in multi-modal embedding and understanding
  3. Semi-supervised learning in multi-modal embedding and understanding
  4. Contrastive learning in multi-modal embedding and understanding
  5. Interpretability in multi-modal embedding and understanding
  6. Interactive multi-modal understanding
  7. Trust AI for multi-modal understanding
  8. Cross-modal matching and retrieval
  9. Cross-modal understanding
  10. Multi-modal deep fake generation and detection
  11. And other related…

Submission Guidelines

Submission Site:
After signing in the CMT3 as the author, click “create new submission…”, please choose "Workshop2:……" to submit the paper to the workshop.

Paper Format:

Submitted papers (.pdf format) must use the ACM Article Template . Please remember to add Concepts and Keywords.

Please use the template in traditional double-column format to prepare your submissions. For example, word users may use Word Interim Template, and latex users may use sample-sigconf template.


As stated in the CfP, submitted papers may be 6 to 8 pages. Up to two additional pages may be added for references. The reference pages must only contain references. Overlength papers will be rejected without review. Optionally, you may upload supplementary material that complements your submission (100Mb limit).


Paper submissions must conform with the "double-blind" review policy. This means that the authors should not know the names of the reviewers of their papers, and reviewers should not know the names of the authors. Please prepare your paper in a way that preserves anonymity of the authors.

Papers without appropriate blinding will be rejected without review.


Papers submitted to ACM Multimedia Asia workshops must be the original work of the authors. The may not be simultaneously under review elsewhere. Publications that have been peer-reviewed and have appeared at other conferences or workshops may not be submitted to the workshop. Authors should be aware that ACM has a strict policy with regard to plagiarism and self-plagiarism ( The authors' prior work must be cited appropriately.

Author List:

Please ensure that you submit your papers with the full and final list of authors in the correct order. The author list registered for each submission is not allowed to change in any way after the paper submission deadline. (Note that this rule regards the identity of authors, e.g., typos are correctable.)


Please proofread your submission carefully. It is essential that the language use in the paper is clear and correct so that it is easily understandable. (Either US English or UK English spelling conventions are acceptable.)

Important dates