Behrooz Omidvar-Tehrani’s Post

View profile for Behrooz Omidvar-Tehrani

Senior ML Scientist @ AWS | LLM Agents for Amazon Q

We are thrilled to announce that our ICML paper, titled "Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation," is now available on arXiv. This paper presents one of the first automated, interpretable, task-specific evaluation methods for Retrieval-Augmented Generation (RAG) in Q&A contexts. For a summary of our contributions, check the following 🧵 from my co-author Laurent Callot on 𝕏: https://lnkd.in/gBF5iXaR. https://lnkd.in/g6r3Xv27 #ICML #LLMEvaluation #AmazonScience #RAG

Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation

Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation

arxiv.org

Oguzhan (Ouz) Gencoglu

Co-founder & Head of AI @ Root Signals | Measure and Control Your GenAI

9mo

Interesting work. The link to your implementation that was mentioned in your paper does not seem to be alive: https://github.com/amazon-science/auto-rag-eval Any pointers to your repo?

Niraj Jetly

Software Engineering Leader at Amazon Web Services (AWS), Ex-CTO/VP Engineering, Board Member

9mo

Congratulations Behrooz Omidvar-Tehrani , it’s very interesting .

Niccolo' Gentile, PhD

Research Scientist @ Foyer Group | AI Applied Research | Ex-Amazon

9mo

Amir Ali Aynetchi this looks very interesting

Sarthak Jain

Signal Processing/NLP @Sony @UofSC AIISC | AI/ML @IIITD

9mo
See more comments

To view or add a comment, sign in

Explore topics