CONCORD: Asynchronous Sparse Aggregation for Device-Cloud RAG

The Challenge of Document Isolation in RAG

Traditional Retrieval-Augmented Generation (RAG) systems often struggle with the trade-off between data privacy and retrieval performance. When documents are isolated on edge devices, centralized indexing becomes impossible, leading to latency and privacy concerns. CONCORD (Asynchronous Sparse Aggregation) addresses this by enabling a hybrid device-cloud architecture that maintains document isolation while allowing the cloud to perform effective retrieval.

Asynchronous Sparse Aggregation

The core innovation of CONCORD is its sparse aggregation mechanism. Instead of requiring full document synchronization or massive data transfers, the system uses an asynchronous approach to aggregate relevant information. By sparsifying the retrieval process, the system minimizes bandwidth usage and reduces the computational load on edge devices. This allows the cloud-based RAG component to query distributed, isolated datasets without compromising the security of the underlying documents.

Performance and Deployment

Designed for integration into modern RAG pipelines, CONCORD optimizes the interaction between local storage and cloud-based LLM inference. By decoupling the retrieval process from the immediate availability of the full document corpus, the system maintains high performance even under constrained network conditions. This approach is particularly relevant for applications requiring strict data sovereignty, such as enterprise search or personal AI assistants, where sensitive documents must remain on the user's device while still being accessible for context-aware generation.

The Challenge of Document Isolation in RAG

Asynchronous Sparse Aggregation

Performance and Deployment

More from AI & LLMs

Deceptive Alignment: When Models Fake Compliance

IMEX: Interaction-Based Model Explanation

GLARE: Natural Language Interfaces for Global Model Explanations

SciRisk-Bench: Evaluating Safety in AI for Science