Papers
arxiv:2511.07577

A Decentralized Retrieval Augmented Generation System with Source Reliabilities Secured on Blockchain

Published on Nov 10
ยท Submitted by Yining Lu on Nov 18
Authors:
,
,
,

Abstract

A decentralized retrieval-augmented generation system uses blockchain-based reliability scoring to manage data sources, enhancing performance and cost-efficiency compared to centralized systems.

AI-generated summary

Existing retrieval-augmented generation (RAG) systems typically use a centralized architecture, causing a high cost of data collection, integration, and management, as well as privacy concerns. There is a great need for a decentralized RAG system that enables foundation models to utilize information directly from data owners who maintain full control over their sources. However, decentralization brings a challenge: the numerous independent data sources vary significantly in reliability, which can diminish retrieval accuracy and response quality. To address this, our decentralized RAG system has a novel reliability scoring mechanism that dynamically evaluates each source based on the quality of responses it contributes to generate and prioritizes high-quality sources during retrieval. To ensure transparency and trust, the scoring process is securely managed through blockchain-based smart contracts, creating verifiable and tamper-proof reliability records without relying on a central authority. We evaluate our decentralized system with two Llama models (3B and 8B) in two simulated environments where six data sources have different levels of reliability. Our system achieves a +10.7\% performance improvement over its centralized counterpart in the real world-like unreliable data environments. Notably, it approaches the upper-bound performance of centralized systems under ideally reliable data environments. The decentralized infrastructure enables secure and trustworthy scoring management, achieving approximately 56\% marginal cost savings through batched update operations. Our code and system are open-sourced at github.com/yining610/Reliable-dRAG.

Community

Paper author Paper submitter

๐Ÿ’ป WHAT WE BUILT: a decentralized RAG (dRAG) system that solves data reliability challenges in real-world settings. The sources provided by each data owner will be securely managed and scored on blockchain.

๐Ÿš€ HOW TO DEPLOY: we provide a one-line command for easy deployment of our dRAG system at https://github.com/yining610/Reliable-dRAG

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2511.07577 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2511.07577 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2511.07577 in a Space README.md to link it from this page.

Collections including this paper 1