Kuldeep Singh Sidhu’s Post

View profile for Kuldeep Singh Sidhu
Kuldeep Singh Sidhu Kuldeep Singh Sidhu is an Influencer

Senior Data Scientist @ Walmart - Search | BITS Pilani

Exciting new research from TU Wien – Academy for Continuing Education and UCL explores how Large Language Models (LLMs) perform in generating Boolean queries for systematic literature reviews. Here's why this matters: >> Key Findings: - GPT-4 and GPT-3.5 significantly outperformed previous benchmarks in precision scores for the CLEF TAR dataset - Open-source models like Mistral and Mixtral showed competitive performance against proprietary GPT models - The study revealed significant variability in query generation results, highlighting reliability challenges >> Technical Deep Dive: The researchers implemented a comprehensive pipeline that: - Automatically generates Boolean queries from review topics using various LLMs - Tests multiple model variants including GPT-3.5-1106, GPT-4-1106, Mistral-7B, and Mixtral-8X7B - Evaluates queries using PubMed database retrieval - Implements seed-based generation for reproducibility testing >> Model Architecture: The study utilized: - API-based models: GPT-3.5, GPT-4, Mistral-tiny, Mistral-small - Locally-run open source models: Mistral-7B-Instruct-v0.2 and Zephyr-7b-beta - Dense encoder setting using SentenceTransformers for similarity matching This research is crucial for medical researchers and information retrieval specialists looking to automate systematic review processes. The code is publicly available on GitHub for further exploration and validation.

  • No alternative text description for this image

To view or add a comment, sign in

Explore topics