CLAMS Platform

Computational Linguistics Applications for Multimedia Services

Role: Core Developer & Researcher
Duration: 2017 – Present
Affiliation: Brandeis University
Website: clams.ai

Overview

CLAMS is an open-source platform for applying computational linguistics and multimedia analysis tools to audiovisual materials in cultural heritage archives. It provides an end-to-end framework for orchestrating NLP, computer vision, and ASR pipelines over video and audio content, producing rich, structured metadata in the Multi-Media Interchange Format (MMIF).

The platform is developed in collaboration with the American Archive of Public Broadcasting at GBH, enabling archivists and librarians to process large collections of public media that would be infeasible to catalog manually.

CLAMS Agent (Thesis Work)

My thesis research focuses on designing an LLM/VLM-powered agentic system that orchestrates the CLAMS platform's multimedia processing capabilities. The agent interprets user requests, selects and sequences appropriate analysis tools, and coordinates the extraction of metadata from archival video — combining language models with the platform's existing NLP, computer vision, and speech recognition applications.

SDK & Application Development

I integrate LLM interfaces into the CLAMS SDK and build applications for multimodal analysis. The SDK enables developers to create interoperable NLP and multimedia processing applications that communicate through MMIF, supporting tasks such as named entity recognition, scene detection, OCR, and automatic speech recognition.

Technical Stack

Python PyTorch LLMs / VLMs Agent Orchestration Docker Flask MMIF NLP Computer Vision ASR

Publications

2025

A Platform for AI-Assisted Archival Metadata Generation

Rim, King, Lynch, Verhagen, Pustejovsky

HCI International 2025

2025

Multimodal Interoperability with the CLAMS Platform

Lynch, Rim, King, Pustejovsky

MultiMedia Modeling (MMM) 2025 — Demo

2022

The CLAMS Platform at Work

Verhagen, Lynch, Rim, Pustejovsky

LREC 2022

2020

Interchange Formats for Visualization: LIF and MMIF

Rim, Lynch, Verhagen, Ide, Pustejovsky

LREC 2020

2019

Computational Linguistics Applications for Multimedia Services

Rim, Lynch, Pustejovsky

LaTeCH-CLfL 2019