Retrieval-augmented generation
거대언어모델 답변의 품질을 높이기 위한 기법 중 하나. 사용자가 입력한 원래의 프롬프트에 해당 프롬프트와 관련된 정보를 추가로 덧붙여서 “증강”한 뒤 이를 기반으로 답변을 생성하게 하기.
Articles
- 2024-11-05 - HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems
- 2024-09-20 - Introducing Contextual Retrieval (Anthropic)
- “A method that dramatically improves the retrieval step in RAG”
- “This method can reduce the number of failed retrievals by 49% and, when combined with reranking, by 67%.”
- “The company’s revenue grew by 3% over the previous quarter.” → “This chunk is from an SEC filing on ACME corp’s performance in Q2 2023; the previous quarter’s revenue was $314 million. The company’s revenue grew by 3% over the previous quarter.”
- 청크에 맥락을 붙이는 방법? Claude Haiku + Prompt Caching.
- 2024-08-19 - The RAG Playbook - jxnl.co
Tools
- github.com/bhavnicksm/chonkie
- “The no-nonsense RAG chunking library that’s lightweight, lightning-fast, and ready to CHONK your texts”