Shivansh Srivastava’s Post

View profile for Shivansh Srivastava

AI Engineer @ techolution | Google Certified ML Engineer | Red Hat Certified (3X)

🚀 DeepSeek-R1: A Paradigm Shift in LLM Reasoning! The AI landscape just witnessed a major breakthrough! DeepSeek-R1, a revolutionary Large Language Model (LLM), has proven that pure Reinforcement Learning (RL) can significantly enhance reasoning capabilities without a single byte of supervised fine-tuning data. Github Repo: https://lnkd.in/g9SyDsdy I decided to run it locally on my system and test its reasoning with a fun query running: “How many ‘r’ are there in the word strawberry?” 🍓 DeepSeek-R1 responded with a fascinating chain of thought, refer to the image below. This demonstrates the model’s ability to reason step-by-step, showcasing the power of RL-driven training in handling language tasks with ease! 🔥 Why Is DeepSeek-R1 Special? 💡 Zero Supervised Data, 100% RL Forget traditional supervised fine-tuning—DeepSeek-R1-Zero evolved entirely through RL, improving itself over multiple iterations. 📊 Crushing Benchmarks Across Domains: •🧮 Mathematics: Achieved a stunning 97.3% on MATH-500, surpassing many top-tier models. •💻 Coding: Scored an impressive 96.3 percentile on Codeforces, demonstrating expert-level coding skills. •🧠 General Reasoning: Excelled across diverse logic and reasoning benchmarks. ❤️ Open-Source Power DeepSeek-AI has generously open-sourced versions of the model, ranging from 1.5B to 70B parameters, giving the AI community access to cutting-edge reasoning capabilities. This is a game-changer in AI research. Smaller models achieving remarkable feats through knowledge distillation show that size isn’t everything! Exciting times ahead 🚀 #AI #MachineLearning #DeepLearning #LLM #GenAI #AIAgents #aiagents #ReinforcementLearning #DeepSeekR1 #OpenSource

  • graphical user interface, text, application
Nitish Sharma

🇮🇳 | AI/ML Fanatic | Ex- AI Developer @DIRO | Ex- AI/ML Developer Inter @MonsterAPI | Ex-AI Intern @MetaGeeksTechnologies | Ex-Project Intern @STMicroElectronics |

1mo

The AI world is evolving at an alarming pace

To view or add a comment, sign in

Explore topics