stark
STaRK provides a comprehensive benchmark for assessing the retrieval effectiveness of large language models on knowledge bases. It includes practical applications in areas like product search, academic inquiry, and biomedicine, offering realistic query challenges to spur advancements in retrieval technology. With easy installation via pip, resources on Hugging Face, and a dedicated leaderboard, STaRK supports researchers in refining context-specific retrieval strategies.