Offered By: IBMSkillsNetwork

AI meeting companion: From voice to insight

Create an app to capture audio (like lectures) and summarize it. You will build it using OpenAI Whisper (text to speech) and summarize with open-source LLAMA 2 LLM hosted by IBM watsonx. You will deploy the app in a serverless environment using IBM Cloud Code Engine.

Continue reading

Guided Project

Artificial Intelligence

164 Enrolled
4.7
(17 Reviews)

At a Glance

Create an app to capture audio (like lectures) and summarize it. You will build it using OpenAI Whisper (text to speech) and summarize with open-source LLAMA 2 LLM hosted by IBM watsonx. You will deploy the app in a serverless environment using IBM Cloud Code Engine.

Imagine you're a student who records your teacher's lecture. We have an AI app that can turn this recording into text, accurately. It also summarizes the lecture and highlights the main points.

In our project, we'll use OpenAI's Whisper to transform speech into text. Next, we'll use IBM watsonx AI to summarize and find key points. This stage couples with prompt engineering through PromptTemplate in Langchain. We'll make an app with HuggingFace Gradio as the user interface. 



The AI app snapshot 
The output from the LLM not only summarizes and highlights key points but also corrects minor mistakes made by the speech-to-text model, ensuring a coherent and accurate result.

A Look at the Project Ahead

Here are the key objectives for your project:
  1. Speech-to-Text Conversion: Utilize OpenAI's Whisper technology to convert lecture recordings into text, accurately.
  2. Content Summarization: Implement IBM Watson's AI to effectively summarize the transcribed lectures and extract key points.
  3. User Interface Development: Create an intuitive and user-friendly interface using HuggingFace Gradio, ensuring ease of use for students and educators.
  4. App Deployment: Learn and apply the skills necessary to deploy the application online using IBM Code Engine, making the tool accessible to a wider audience.

What You'll Need

General knowledge of Python and a browser. 

Certificate

No Certificate Offered

Estimated Effort

45 Min

Level

Intermediate

Industries

Information Technology

Skills You Will Learn

Artificial Intelligence, Python

Language

English

Course Code

GPXX04C6EN

Tell Your Friends!

Saved this page to your clipboard!

Have questions or need support? Chat with me 😊