Practical AI | Optimizing for efficiency with IBM’s Granite

Optimizing for efficiency with IBM’s Granite

March 14, 2025 / 43:38/E306

We often judge AI models by leaderboard scores, but what if efficiency matters more? Kate Soule from IBM joins us to discuss how Granite AI is rethinking AI at the edge—breaking tasks into smaller, efficient components and co-designing models with hardware. She also shares why AI should prioritize efficiency frontiers over incremental benchmark gains and how seamless model routing can optimize performance.

Featuring:

Kate Soule – LinkedIn
Chris Benson – Website, GitHub, LinkedIn, X
Daniel Whitenack – Website, GitHub, X

Links:

IBM Granite
IBM Granite on Hugging Face
IBM Expands Granite Model Family with New Multi-Modal and Reasoning AI Built for the Enterprise

Creators and Guests

Host

Chris Benson

Cohost @ Practical AI Podcast • AI / Autonomy Research Engineer @ Lockheed Martin

Guest

Kate Soule

Optimizing for efficiency with IBM’s Granite

Broadcast by

Creators and Guests

headphones Listen Anywhere

Listen Anywhere