Multimodal Interoperability with the CLAMS Platform

Kelley Lynch, Kyeongmin Rim, Owen King, James Pustejovsky

MultiMedia Modeling: 31st International Conference (MMM 2025), Nara, Japan

Demo Track · Springer · Pages 173–179

Abstract

This paper presents the CLAMS (Computational Linguistics Applications for Multimedia Services) platform, which provides a framework for developing and deploying interoperable multimedia analysis tools. CLAMS facilitates the processing of audiovisual content such as broadcast news videos by enabling seamless integration of tools across different media types including text, audio, video, and images. At the core of CLAMS is the Multi-Media Interchange Format (MMIF), a JSON-based annotation format designed to support the exchange of data between different tools in a consistent and structured manner. This ensures that annotations produced by one tool can be readily used by others, creating complex pipelines for automated content analysis. We describe the features provided by the CLAMS software development kit (SDK), present 2 example pipelines of CLAMS applications, and show a visualization tool for exploring data generated using CLAMS.

BibTeX

@inproceedings{lynch2025clams,
  author    = {Lynch, Kelley and Rim, Kyeongmin and King, Owen
               and Pustejovsky, James},
  title     = {Multimodal Interoperability with the {CLAMS} Platform},
  booktitle = {MultiMedia Modeling: 31st International Conference,
               MMM 2025, Nara, Japan},
  year      = {2025},
  pages     = {173--179},
  publisher = {Springer},
  doi       = {10.1007/978-981-96-2074-6_19}
}