We are thrilled to announce that our ICML paper, titled "Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation," is now available on arXiv. This paper presents one of the first automated, interpretable, task-specific evaluation methods for Retrieval-Augmented Generation (RAG) in Q&A contexts. For a summary of our contributions, check the following 🧵 from my co-author Laurent Callot on 𝕏: https://lnkd.in/gBF5iXaR. https://lnkd.in/g6r3Xv27 #ICML #LLMEvaluation #AmazonScience #RAG
Congratulations Behrooz Omidvar-Tehrani , it’s very interesting .
Amir Ali Aynetchi this looks very interesting
Congrats Behrooz Omidvar-Tehrani
Co-founder & Head of AI @ Root Signals | Measure and Control Your GenAI
9moInteresting work. The link to your implementation that was mentioned in your paper does not seem to be alive: https://github.com/amazon-science/auto-rag-eval Any pointers to your repo?