Agents of Intelligence
Exploring AI with the power of AI — Agents of Intelligence is a cutting-edge podcast dedicated to covering a wide range of topics about artificial intelligence. Our process blends human insight with AI-driven research—each episode starts with a curated list of topics, followed by AI agents scouring the web for the best public content. AI-powered hosts then craft an engaging, well-researched discussion, which is reviewed by a subject matter expert before being shared with the world. The result? A seamless fusion of AI efficiency and human expertise, bringing you the most insightful conversations on AI’s latest developments, challenges, and future impact.
Episodes
Thursday Feb 06, 2025
Fine-Tuning vs. Retrieval: What’s the Best Way to Teach AI?
Thursday Feb 06, 2025
Thursday Feb 06, 2025
When it comes to less popular knowledge, how should we train AI? Should we fine-tune it or let it retrieve information on the fly? In this episode, we break down a groundbreaking study that compares these two approaches—Fine-Tuning (FT) vs. Retrieval-Augmented Generation (RAG)—to see which one better equips AI models for niche factual knowledge. We also explore a novel approach called Stimulus RAG, which boosts retrieval accuracy without expensive fine-tuning. Tune in to find out which method wins and what it means for AI customization!
Thursday Feb 06, 2025
GraphRAG: Revolutionizing AI Data Retrieval with Knowledge Graphs
Thursday Feb 06, 2025
Thursday Feb 06, 2025
Data retrieval is getting a major upgrade! In this episode, we dive into Structured-GraphRAG, a new framework that enhances AI-powered retrieval by integrating knowledge graphs (KGs) with large language models. Using a case study on soccer data, this approach drastically reduces hallucinations, improves accuracy, and speeds up response times by over 98%. Join us as we explore how Structured-GraphRAG is setting a new standard for AI-driven information retrieval.
Thursday Feb 06, 2025
FRAMES: The Next-Level Test for AI’s Fact-Checking and Reasoning Skills
Thursday Feb 06, 2025
Thursday Feb 06, 2025
How well do AI models really think? In this episode, we explore FRAMES, a groundbreaking evaluation benchmark designed to push Retrieval-Augmented Generation (RAG) systems to their limits. Unlike traditional benchmarks, FRAMES assesses factual retrieval, reasoning, and synthesis together, exposing key weaknesses in today’s most advanced AI models. Tune in to discover why even state-of-the-art systems struggle with multi-hop reasoning—and what it means for the future of AI reliability.
Thursday Feb 06, 2025
Phi-1: Smarter AI, Smaller Model—The Power of Textbook Training
Thursday Feb 06, 2025
Thursday Feb 06, 2025
Bigger isn’t always better. In this episode, we break down Microsoft Research’s latest AI breakthrough—Phi-1, a 1.3 billion parameter coding model that outperforms much larger models by focusing on high-quality, textbook-style data. Discover how this approach challenges traditional scaling laws, slashes computational costs, and paves the way for more efficient AI development. Tune in as we explore the future of coding AI and why “textbooks are all you need."
Thursday Feb 06, 2025
Beyond Automation Hype: The Economics of AI Adoption
Thursday Feb 06, 2025
Thursday Feb 06, 2025
AI isn't taking over jobs as fast as we think—it's all about the economics. In this episode, we dive into MIT’s latest research on computer vision automation and unpack why cost, scale, and deployment matter more than just technical feasibility. From small businesses to AI-as-a-service models, we explore what actually makes automation worth the investment and what that means for the future of work.
Tuesday Jan 28, 2025
Beyond Models: The Rise of Generative AI Agents
Tuesday Jan 28, 2025
Tuesday Jan 28, 2025
Generative AI is evolving beyond standalone models into fully functional agents capable of reasoning, planning, and interacting with the world through tools. In this episode, we explore the architecture of AI agents, how they differ from traditional models, and the role of tools like LangChain and Vertex AI in their development. Discover the future of AI autonomy and what it means for industries and everyday applications.
Monday Jan 27, 2025
Hallucination Mitigation: The Future of Multi-Agent AI Systems
Monday Jan 27, 2025
Monday Jan 27, 2025
Discover how multi-agent AI frameworks are redefining the fight against hallucinations in large language models. This episode explores how layered agents, guided by the OVON framework, reduce speculative content through iterative refinement and structured data exchange. Learn about novel KPIs, empirical results, and the potential of agentic AI to enhance trust and transparency in generative AI systems.
Monday Jan 27, 2025
MedAgentBench: Redefining AI as Medical Agents
Monday Jan 27, 2025
Monday Jan 27, 2025
Explore how MedAgentBench benchmarks large language models (LLMs) as medical agents, moving beyond chatbots to tackle real-world clinical tasks. This episode unpacks the dataset's 100 clinically derived tasks, its FHIR-compliant interactive environment, and insights into the current state of LLM performance. Learn how AI can reduce administrative burdens and improve healthcare delivery.
Monday Jan 27, 2025
Monday Jan 27, 2025
Explore how cutting-edge AI transforms remote robotic surgery. Using the Informer model, researchers tackle network-induced issues like jitter and packet loss, ensuring real-time precision for Patient Side Manipulators. Discover the role of predictive AI in overcoming latency challenges, enhancing accuracy, and reshaping surgical possibilities with the Tactile Internet.
Monday Jan 27, 2025
Lost in Translation: The Multilingual Challenges of LLMs in Healthcare
Monday Jan 27, 2025
Monday Jan 27, 2025
Explore the risks of inconsistent multilingual health advice from large language models (LLMs). This episode uncovers disparities in LLM-generated responses across languages, highlighting challenges in cross-lingual consistency, cultural biases, and their implications for equitable healthcare. Learn about groundbreaking evaluation frameworks, key findings, and what this means for the future of AI in healthcare.