Offered By: IBMSkillsNetwork
Evaluating an NBA Player's Team Impact via Regression
Learn how to apply Random Forest and Kernel Ridge Regression by analyzing an NBA player's real game performance. In this guided project, you’ll engineer features, tune model hyperparameters, and analyze feature importance to understand what truly drives a player’s performance on the court. You'll leverage pandas for data manipulation, scikit-learn for model training and evaluation, and matplotlib for visualization. By the end, you’ll be able to confidently build, configure, and interpret industry standard regression models for a wide range of analytical use cases.
Continue readingGuided Project
Machine Learning
At a Glance
Learn how to apply Random Forest and Kernel Ridge Regression by analyzing an NBA player's real game performance. In this guided project, you’ll engineer features, tune model hyperparameters, and analyze feature importance to understand what truly drives a player’s performance on the court. You'll leverage pandas for data manipulation, scikit-learn for model training and evaluation, and matplotlib for visualization. By the end, you’ll be able to confidently build, configure, and interpret industry standard regression models for a wide range of analytical use cases.
A Look at the Project Ahead
- Ingest and Explore Data: Load game logs, inspect distributions, and spot trends in his performance.
- Engineer Predictive Features: Create new metrics (e.g. rolling averages, usage rates) that capture hidden aspects of a player's game.
- Train and Tune Models: Build Random Forest and Kernel Ridge Regression pipelines in scikit‑learn and optimize hyperparameters.
- Analyze Results: Use feature‑importance scores to interpret which factors drive a player's performance.
- Visualize Results: Plot your findings with matplotlib.
Learning Objectives
- Feature Engineering & Pipelines:
- Design and integrate custom processes and pipelines to prepare real sports data for regression.
- Model Training, Tuning & Interpretation:
- Configure, evaluate, and interpret both tree‑based and kernel‑based regression models to extract actionable insights.
- Learn how each model trains and how hyperparameters affect the training process.
What You'll Need
- Familiarity with Python programming
- Understanding of fundamental machine learning concepts (e.g. regression, overfitting, cross‑validation)
- Basic familiarity with pandas DataFrames and its methods
- A modern browser (Chrome, Edge, Firefox, Safari)
Estimated Effort
30 Minutes
Level
Beginner
Skills You Will Learn
Feature Engineering, Machine Learning, Pandas, Random forest, Regression, Scikit-learn
Language
English
Course Code
GPXX0H19EN