DiaRAG：面向糖尿病领域的智能问答系统

杨涛; 欧阳纯萍; 余颖; 万亚平

doi:10.13374/j.issn2095-9389.2024.12.31.003

摘要: 为了满足糖尿病领域对智能问答系统高效性与专业性的双重需求，本文设计并实现了融合知识图谱与检索增强生成（Retrieval augmented generation, RAG）的糖尿病领域智能问答系统——DiaRAG. 该系统提出了一种自动提示生成方法（Auto prompt generation, APG），能够自动生成适用于糖尿病领域的提示模板，用于提取糖尿病知识图谱并构建检索知识库. 同时，通过提示学习对病患提出的问句进行校正，有效解决了复杂问句中的语义和语法偏误问题. 此外，本文设计了微调排序模型（Fine-tuned reranker），对糖尿病知识图谱的社区摘要进行二次过滤，以确保检索结果与病患提问意图的高度契合. DiaRAG系统通过深度融合知识图谱与大语言模型（Large language model, LLM），充分利用外部知识库，从而显著提升了糖尿病领域知识的问答能力. 实验结果表明，DiaRAG在问答准确性、社区摘要相关性等方面均显著优于现有系统，为糖尿病个性化知识服务提供了创新性解决方案.

Abstract:

To address the dual requirements of efficiency and professionalism in diabetes-related intelligent question-answering, this study presents DiaRAG, an innovative system that synergistically integrates knowledge graphs with retrieval-augmented generation (RAG) techniques. The proposed system is specifically tailored to the diabetes domain, in which both medical expertise and updated knowledge are critical. DiaRAG introduces an autoprompt generation (APG) method that automatically synthesizes diabetes-specific prompt templates. These templates are used to extract structured information from diabetes literature and clinical data, thus facilitating the construction of a comprehensive diabetes knowledge graph and a dedicated retrieval knowledge base. By applying APG, the system effectively generates candidate prompts that enhanced the extraction of relevant knowledge triples, addressing the challenges posed by ambiguous or complex medical queries and ensuring that the subsequent retrieval process is grounded in an accurate, domain-specific context.

Furthermore, DiaRAG integrates a specialized text correction module based on PL-BART (Prompt Learning and Bidirectional Auto-Regressive Transformers). This module is designed to correct semantic and syntactic errors in patient queries. By leveraging prompt-guided correction, PL-BART improves the clarity of input questions, thus enabling the retrieval module to perform more precise matching with the underlying diabetes knowledge graph.

In the retrieval phase, a fine-tuned re-ranker model is introduced to further optimize the ordering of the candidate community summaries. This re-ranker, built on a cross-encoder architecture that employs BERT, evaluates the relevance of the retrieved documents to the patient’s query. The secondary filtering provided by this module not only enhances the alignment between the query intent and the retrieved content but also mitigates the common issue of hallucinations in large language models (LLMs) by ensuring that only high-quality, domain-relevant information is passed to the generation stage.

Experimental evaluations were conducted on the DaCorp diabetes question-answering dataset, and the results showed that DiaRAG achieved superior performance compared to state-of-the-art models, such as GPT-3.5, HuatuoGPT, and other retrieval-augmented frameworks, such as NaiveRAG and SelfRAG. Key evaluation metrics, including ROUGE-1, ROUGE-2, and ROUGE-L, indicated that DiaRAG consistently outperformed baseline methods in terms of answer accuracy and community summary relevance.

Ablation studies further demonstrated that each component—the APG module, PL-BART-based text correction, and fine-tuned re-ranker —contributed significantly to the overall system performance. Notably, iterative prompt optimization via APG and a specialized re-ranking process have been shown to be critical for handling the intricate and specialized language inherent in diabetes-related queries. In a detailed case study involving patient inquiries about the suitability of a traditional Chinese medicine for diabetic conditions, DiaRAG provided a comprehensive answer that not only considered the general pharmacological properties of the medicine but also incorporated detailed clinical insights. This nuanced explanation, which directly addressed the complexities of diabetic complications and the specific indications of the medicine, resulted in expert evaluations rating DiaRAG’s response significantly higher than those provided by competing models such as GPT-3.5 and HuatuoGPT. The experts praised DiaRAG for its precise and contextually appropriate advice, which ultimately highlighted the system’s potential for delivering personalized and reliable medical guidance.

Overall, DiaRAG represents an important advancement in the design of domain-specific intelligent question-answering systems. Seamlessly integrating structured knowledge extraction, robust text correction, and refined retrieval strategies, it offers an innovative solution for personalized medical knowledge services in diabetes care.

DiaRAG：面向糖尿病领域的智能问答系统

DiaRAG: intelligent question-answering system for the diabetes domain