Missed out on TechXchange 2025? No worries! Our workshops are now available to everyone 🤩 Learn more

Offered By: IBMSkillsNetwork

Multi-Agent Restaurant Chatbot with Vision LLM, CrewAI & RAG

Build your own multi-agent restaurant recommendation chatbot with Multimodal LLM, CrewAI, Chroma DB and RAG. Learn to orchestrate agents through YAML-defined configurations and Pydantic schemas to ensure structured, reliable outputs. Use IBM watsonx slate embeddings and a Chroma vector database to power restaurant retrieval. Integrate a Vision LLM to analyze dining photos and a Serper web tool to capture real-time food trends. Explore how CrewAI enables collaborative agent workflows that combine RAG retrieval and online trend analysis to create an adaptive, smart dining assistant.

Continue reading

Guided Project

Artificial Intelligence

At a Glance

Build your own multi-agent restaurant recommendation chatbot with Multimodal LLM, CrewAI, Chroma DB and RAG. Learn to orchestrate agents through YAML-defined configurations and Pydantic schemas to ensure structured, reliable outputs. Use IBM watsonx slate embeddings and a Chroma vector database to power restaurant retrieval. Integrate a Vision LLM to analyze dining photos and a Serper web tool to capture real-time food trends. Explore how CrewAI enables collaborative agent workflows that combine RAG retrieval and online trend analysis to create an adaptive, smart dining assistant.

Have you ever struggled to decide where to eat? With countless options and ever-changing food trends, choosing a restaurant can be overwhelming. In this project, you’ll build an AI-powered restaurant recommender system that goes far beyond simple filtering by cuisine or price. By orchestrating multiple CrewAI agents and integrating Retrieval-Augmented Generation (RAG) with a WatsonX Slate embedding model and Chroma vector database, you’ll create an intelligent dining assistant that learns from user history, interprets dining photos, and adapts to real-time food trends. This system doesn’t just suggest popular spots—it builds a nuanced profile of the user’s tastes and recommends restaurants that fit their preferences, budget, and style of dining.

What You’ll Learn

By the end of this project, you will be able to:
  • Build a RAG-powered retriever for restaurants: Encode restaurant data with WatsonX Slate embeddings and store them in Chroma for efficient similarity search.
  • Create and coordinate multi-agent systems with CrewAI: Define agents like the Profile Builder, Coarse RAG Recommender, Food Trend Analyst, and Recommendation Finalizer to work collaboratively.
  • Analyze multimodal user data: Use vision-capable LLMs to extract insights from dining photos and integrate them with textual reviews.
  • Incorporate real-time web search: Use Serper to fetch trending cuisines and new restaurants, ensuring recommendations stay current.
  • Deliver structured outputs with Pydantic: Standardize agent outputs for reliability and seamless downstream processing.
  • Run and orchestrate an end-to-end pipeline: From user history to final ranked recommendations, you’ll deploy a fully functional AI restaurant recommender.
  • Build a Gradio Web App for online chatbot!

Who Should Enroll

  • Early-career data scientists and ML engineers who want hands-on experience building multi-agent systems.
  • AI enthusiasts curious about CrewAI, RAG, and multimodal LLM applications.
  • Students and researchers interested in applying cutting-edge AI to real-world recommendation challenges.
  • Developers in food-tech or hospitality who want to explore AI-driven personalization.

Why Enroll

This project bridges recommender systems, multimodal AI, and agent orchestration to show how intelligent assistants can transform the way people discover restaurants. Instead of just listing top-rated spots, your system will understand the user’s preferences, stay updated with trends, and deliver personalized, trustworthy suggestions. By the end, you’ll have a working multi-agent CrewAI restaurant recommender, practical experience with RAG pipelines and multimodal LLMs, and insights into how AI can power the next generation of personalized dining experiences.

What You’ll Need

To get the most out of this project, you should have:
  • Basic Python programming knowledge.
  • Familiarity with LLM concepts and vector databases (helpful but not required).
  • (Optional) Sign up for a free Serper API account as a part of the project for real-time online search.
  • Interest in AI agents, recommender systems, and multimodal data.
All dependencies are pre-configured in the environment, and the project runs best on current versions of Chrome, Edge, Firefox, or Safari.

Estimated Effort

1 Hr + 45 Mins

Level

Intermediate

Skills You Will Learn

AI Agents, CrewAI, Multi-Agent Systems, Multimodal AI, RAG, Recommender System

Language

English

Course Code

GPXX0RK7EN

Tell Your Friends!

Saved this page to your clipboard!

Have questions or need support? Chat with me 😊