Skip to yearly menu bar Skip to main content


Poster
in
Workshop: CODEML: Championing Open-source DEvelopment in Machine Learning

M(M)ORE : Massive Multimodal Open RAG & Extraction

Alexandre Sallinen · Stefan Krsteski · Paul Teiletche · Allard Marc-Antoine · Baptiste Lecoeur · Michael Zhang · Fabrice Nemo · David Kalajdzic · Matthias Meyer · Mary-Anne Hartley

[ ] [ Project Page ]
Fri 18 Jul 2:15 p.m. PDT — 3 p.m. PDT

Abstract:

We introduce MMORE, an open-sourcepipeline for Massive Multimodal Open Retrieval-Augmented Generation and Extraction, designedto ingest, transform, and retrieve knowledgefrom heterogeneous document formats at scale.MMORE supports more than fifteen file types,including text, tables, images, emails, audio,and video, and processes them into a unifiedformat to enable downstream applicationsfor LLMs. The architecture offers modular,distributed processing, enabling scalable paral-lelization across CPUs and GPUs. On processingbenchmarks, MMORE demonstrates a 3.8-foldspeedup over single-node baselines and 40%higher accuracy than Docling on scanned PDFs.The pipeline integrates hybrid dense-sparseretrieval and supports both interactive APIs andbatch RAG endpoints. Evaluated on PubMedQA,MMORE-augmented medical LLMs improvebiomedical QA accuracy with increasing retrievaldepth. MMORE provides a robust, extensiblefoundation for deploying task-agnostic RAGsystems on diverse, real-world multimodal data

Chat is not available.