Automating ML with PyCaret: Train & Compare Multiple Models to Find the Best Performer

PyData Global 2025 Recap

A live demonstration of how PyCaret simplifies machine learning workflows by enabling users to train and compare multiple models with minimal code.
PyData
Machine Learning
PyCaret
Model Comparison
Automation
Author

Oren Bochman

Published

Thursday, December 11, 2025

Keywords

PyData, Machine Learning, PyCaret, Model Comparison, Automation

pydata global

pydata global
TipLecture Overview

This Live demonstration shows how PyCaret, an open-source low-code machine learning library, can dramatically simplify model training and comparison workflows. PyCaret is democratizing machine learning by empowering anyone to train multiple algorithms and compare their performance with minimal code. Attendees will witness live demonstrations of training various ML algorithms and using automated comparison techniques to select the best performer based on key metrics. Perfect for data scientists, developers, and ML enthusiasts looking to spend less time coding and more time on model analysis and selection.

Machine learning workflows often involve repetitive tasks, complex code, and time-consuming model comparisons. PyCaret changes this paradigm by democratizing machine learning - empowering anyone to train multiple algorithms and systematically compare their performance with low-code solutions. With PyCaret’s philosophy of “spend less time coding and more time on analysis,” this library transforms the model selection process by automating training and comparison across multiple algorithms.

TipWhat You’ll Learn:
  • Practical knowledge of automated model training and comparison
  • Experience with systematic algorithm evaluation using PyCaret
  • Understanding of performance metrics for model selection
  • Ready-to-use code examples for multi-algorithm comparison
  • Confidence to choose the best ML algorithm for your specific projects
TipPrerequisites:
  • Basic understanding of Python
  • Familiarity with machine learning concepts (helpful but not required)
  • No prior PyCaret experience needed

Join us for this fast-paced, demo-heavy session that will transform how you approach machine learning projects!

TipSpeakers:

Manjunath Janardhan

I am a Principal AI Engineer with over two decades of experience transforming complex business challenges through innovative AI solutions. My career is defined by delivering measurable impact, including a patented Intelligent Service Platform that achieved an 80% reduction in operational costs. Currently at MSG Global Solutions, I lead AI development initiatives for SAP Enterprise applications, with a primary focus on SAP Profitability and Performance Management (PaPM). My work involves architecting and implementing enterprise-scale Generative AI solutions for the PaPM Universal Model, where I integrate vector databases with SAP HANA to significantly enhance information retrieval capabilities.

My previous role at GE Healthcare demonstrated my ability to scale AI solutions globally, where I built on-premises Generative AI systems that boosted developer productivity by 40% across international teams. I specialize in combining open-source Large Language Models with Hybrid-RAG and Agentic techniques, leveraging cloud-native architectures across AWS, Azure, and GCP platforms. My portfolio includes high-impact tools such as MICT GPT, CODE GPT, and Service GPT, with Aspire CODE GPT notably reducing development time for the Aspire CT Product by 30%.

My technical foundation encompasses the complete software development lifecycle, from modernizing monolithic systems to microservices using Java and C++, to containerizing applications with Docker and Kubernetes. I maintain active contributions to open-source NLP projects, reflecting my commitment to advancing the broader AI community.

Professional development remains central to my practice. I regularly engage with the AI community through conferences, workshops, webinars, and hackathons, recently developing a working prototype for a Socratic DSA Tutor. As an industry speaker, Medium blogger, and content creator, I share practical insights on AI implementation strategies and emerging technologies, focusing on mentoring the next generation of AI engineers while driving innovation in enterprise AI applications.

demo + slide repo

Outline

  • ML and PyCaret Fundamentals (13 mins)
    • What is Machine Learning, Machine Learning Algorithms and workflows
    • What is PyCaret
  • Live Demo: Multi-Algorithm Training & Comparison (10 mins)
    • Hands-on demonstration using the Diabetes Dataset
    • Training multiple algorithms simultaneously with minimal code
    • Automated model comparison using various performance metrics
    • Real-time exploration of model performance visualizations
    • Selecting the best performer based on key evaluation metrics
  • Wrap-up & Resources (2 mins)
    • Key takeaways and next steps
    • Access to GitHub repository with slides and demo notebooks
  • Q&A (5 min)

Title

Title

About me

About me

Agenda

Agenda

Introduction to Machine Learning

Introduction to Machine Learning

Machine Learning vs Traditional Coding

Machine Learning vs Traditional Coding

Machine Learning vs Traditional Coding

Machine Learning vs Traditional Coding

Why PyCaret?

Why PyCaret?
  • Low-code, fast MLexperimentation in Python
  • Clean API for training, comparing,and tuning models
  • Automation of preprocessing,imputation, and feature selection
  • Multiple model benchmarking with one command

Live Demo

Live Demo

Live Demo – Training Models

Live Demo – Training Models

PyCaret vs MLOps Tools

PyCaret vs MLOps Tools

KeyTakeaways

KeyTakeaways
  • PyCaret streamlines the entire MLworkflow
  • Enables efficient model development and testing
  • Supports easy model comparison,persistence, and deployment
  • Great for rapid prototyping and education!

Resources

Resources

Thanks

Thanks

Reflections

Pycaret looks like a better solution for rapid prototyping of machine learning models compared to building everything from scratch. This seems to be another option of moving towards productionizing machine learning models starting with a notebook.

Citation

BibTeX citation:
@online{bochman2025,
  author = {Bochman, Oren},
  title = {Automating {ML} with {PyCaret:} {Train} \& {Compare}
    {Multiple} {Models} to {Find} the {Best} {Performer}},
  date = {2025-12-11},
  url = {https://orenbochman.github.io/posts/2025/2025-12-11-pydata-pycaret/},
  langid = {en}
}
For attribution, please cite this work as:
Bochman, Oren. 2025. “Automating ML with PyCaret: Train & Compare Multiple Models to Find the Best Performer.” December 11, 2025. https://orenbochman.github.io/posts/2025/2025-12-11-pydata-pycaret/.