Offered By: IBMSkillsNetwork
Overthinking AI? Comparing Top LLMs on Reasoning
OpenAI’s o3, DeepSeek-R1, and IBM’s Granite-3.2 are redefining problem-solving with logical thinking. This guided project puts their reasoning to the test with classic riddles, revealing their strengths, weaknesses, and tendencies to overthink. Compare responses from different models—who solves puzzles efficiently, and who gets lost in details? Observe how prompt instructions shape reasoning, from concise answers to step-by-step breakdowns. Gain insights into AI's thinking process and its balance between clarity and complexity.
Continue readingGuided Project
Artificial Intelligence
At a Glance
OpenAI’s o3, DeepSeek-R1, and IBM’s Granite-3.2 are redefining problem-solving with logical thinking. This guided project puts their reasoning to the test with classic riddles, revealing their strengths, weaknesses, and tendencies to overthink. Compare responses from different models—who solves puzzles efficiently, and who gets lost in details? Observe how prompt instructions shape reasoning, from concise answers to step-by-step breakdowns. Gain insights into AI's thinking process and its balance between clarity and complexity.
How Many R’s Are Actually in Strawberry? AI Tries to Reason It Out
A Look at the Project Ahead
✅ Compare AI reasoning styles by testing multiple models on logic puzzles.
✅ Observe overthinking vs. direct solutions, identifying when excessive reasoning helps or hinders.
✅ Experiment with prompt engineering to guide AI reasoning effectively.

What You'll Need
🔹 Access to a Browser to run the Generative AI Classroom lab, where you can easily compare models side by side.
🔹 Basic understanding of logic puzzles (no programming required!).
🔹 Curiosity to explore AI’s strengths and quirks in solving problems.
Key Takeaways: What You’ll Learn
🧠 Overthinking vs. Clarity – Learn when detailed reasoning is beneficial and when it leads to unnecessary complexity.
📊 AI Strengths and Weaknesses – Discover where AI excels and where it struggles with common sense and probabilistic thinking.
Final Thought
Estimated Effort
30 Minutes
Level
Beginner
Skills You Will Learn
Generative AI, LLM
Language
English
Course Code
GPXX01MCEN