
We had the privilege of hosting Peter Belcak – an AI Researcher working on the reliability and efficiency of agentic systems at NVIDIA – who walked us through his new paper making the rounds in AI circles titled “Small Language Models are the Future of Agentic AI.” The paper posits that small language models (SLMs) are sufficiently powerful, inherently more suitable, and necessarily more economical for many invocations in agentic systems, and are therefore the future of agentic AI. The authors’ argumentation is grounded in the current level of capabilities exhibited by SLMs, the common architectures of agentic systems, and the economy of LM deployment. The authors further argue that in situations where general-purpose conversational abilities are essential, heterogeneous agentic systems (i.e., agents invoking multiple different models) are the natural choice. They discuss the potential barriers for the adoption of SLMs in agentic systems and outline a general LLM-to-SLM agent conversion algorithm. Learn more about AI observability and evaluation, join the Arize AI Slack community or get the latest on LinkedIn and X.
Podzilla Summary coming soon
Sign up to get notified when the full AI-powered summary is ready.
Free forever for up to 3 podcasts. No credit card required.

CUGA Agent: From Benchmarks to Business Impact of IBM's Generalist Agent

TUMIX: Multi-Agent Test-Time Scaling with Tool-Use Mixture

Meta AI Researcher Explains ARE and Gaia2: Scaling Up Agent Environments and Evaluations

Georgia Tech's Santosh Vempala Explains Why Language Models Hallucinate, His Research With OpenAI
Free AI-powered recaps of Deep Papers and your other favorite podcasts, delivered to your inbox.
Free forever for up to 3 podcasts. No credit card required.