Advaith Sridhar’s Post

View profile for Advaith Sridhar

AI at CMU | Best Outgoing Student, IIT-Madras

Frontier models are saturating reasoning benchmarks such as ARC-AGI. But is that enough to call these models intelligent? 🧠 We need more robust ways to evaluate intelligence. I believe linguistics may be a powerful tool here - it requires exceptional reasoning skills, and current frontier models perform poorly on even easy problems in the domain. In my latest blog post, I explore why linguistics offers a unique and meaningful way to test intelligence. Check it out here: https://lnkd.in/gMVQJiuh I’d also love to hear thoughts on this - what other domains do you think could push the boundaries for evaluating AI?

Linguistics - A test to measure AI intelligence

Linguistics - A test to measure AI intelligence

advaithsridhar.blog

Chirag Mehta

Co-founder & CPO @ Coheso (legaltech) | AI @ CMU | x-HP Inc

2mo

Very insightful and cited sources help build conviction. High quality stuff. I’m surprised that O1 outperformed the competition drastically. Would you like me to run some prompts on O1-pro for further testing?

To view or add a comment, sign in

Explore topics