Powered by RND

The AI Fundamentalists

Dr. Andrew Clark & Sid Mangalik
The AI Fundamentalists
Nieuwste aflevering

Beschikbare afleveringen

5 van 39
  • Metaphysics and modern AI: What is thinking? - Series Intro
    This episode is the intro to a special project by The AI Fundamentalists’ hosts and friends. We hope you're ready for a metaphysics mini‑series to explore what thinking and reasoning really mean and how those definitions should shape AI research. Join us for thought-provoking discussions as we tackle basic questions: What is metaphysics and its relevance to AI? What constitutes reality? What defines thinking? How do we understand time? And perhaps most importantly, should AI systems attempt to "think," or are we approaching the entire concept incorrectly? Show notes:• Why metaphysics matters for AI foundations• Definitions of thinking from peers and what they imply• Mixture‑of‑experts, ranking, and the illusion of reasoning• Turing test limits versus deliberation and causality• Towers of Hanoi, agentic workflows, and brittle stepwise reasoning• Math, context, and multi‑component system failures• Proposed plan for the series and areas to explore• Invitation for resources, critiques, and future guestsWe hope you enjoy this philosophical journey to examine the intersection of ancient philosophical questions and cutting-edge technology.What did you think? Let us know.Do you have a question or a discussion topic for the AI Fundamentalists? Connect with them to comment on your favorite topics: LinkedIn - Episode summaries, shares of cited articles, and more. YouTube - Was it something that we said? Good. Share your favorite quotes. Visit our page - see past episodes and submit your feedback! It continues to inspire future episodes.
    --------  
    16:19
  • AI in practice: Guardrails and security for LLMs
    In this episode, we talk about practical guardrails for LLMs with data scientist Nicholas Brathwaite. We focus on how to stop PII leaks, retrieve data, and evaluate safety with real limits. We weigh managed solutions like AWS Bedrock against open-source approaches and discuss when to skip LLMs altogether.• Why guardrails matter for PII, secrets, and access control• Where to place controls across prompt, training, and output• Prompt injection, jailbreaks, and adversarial handling• RAG design with vector DB separation and permissions• Evaluation methods, risk scoring, and cost trade-offs• AWS Bedrock guardrails vs open-source customization• Domain-adapted safety models and policy matching• When deterministic systems beat LLM complexityThis episode is part of our "AI in Practice” series, where we invite guests to talk about the reality of their work in AI. From hands-on development to scientific research, be sure to check out other episodes under this heading in our listings.Related research:Building trustworthy AI: Guardrail technologies and strategies (N. Brathwaite)Nic's GitHubWhat did you think? Let us know.Do you have a question or a discussion topic for the AI Fundamentalists? Connect with them to comment on your favorite topics: LinkedIn - Episode summaries, shares of cited articles, and more. YouTube - Was it something that we said? Good. Share your favorite quotes. Visit our page - see past episodes and submit your feedback! It continues to inspire future episodes.
    --------  
    35:11
  • AI in practice: LLMs, psychology research, and mental health
    We’re excited to have Adi Ganesan, a PhD researcher at Stony Brook University, the University of Pennsylvania, and Vanderbilt, on the show. We’ll talk about how large language models LLMs) are being tested and used in psychology, citing examples from mental health research. Fun fact: Adi was Sid's research partner during his Ph.D. program.Discussion highlightsLanguage models struggle with certain aspects of therapy including being over-eager to solve problems rather than building understandingCurrent models are poor at detecting psychomotor symptoms from text alone but are oversensitive to suicidality markersCognitive reframing assistance represents a promising application where LLMs can help identify thought trapsProper evaluation frameworks must include privacy, security, effectiveness, and appropriate engagement levelsTheory of mind remains a significant challenge for LLMs in therapeutic contexts; example: The Sally-Anne Test.Responsible implementation requires staged evaluation before patient-facing deploymentResourcesTo learn more about Adi's research and topics discussed in this episode, check out the following resources:Large language models could change the future of behavioral healthcare: a proposal for responsible development and evaluationTherapist Behaviors paper: [2401.00820] A Computational Framework for Behavioral Assessment of LLM Therapists Cognitive reframing paper: Cognitive Reframing of Negative Thoughts through Human-Language Model Interaction - ACL Anthology Faux Pas paper: Testing theory of mind in large language models and humans | Nature Human Behaviour READI: Readiness Evaluation for Artificial Intelligence-Mental Health Deployment and Implementation (READI): A Review and Proposed Framework Large language models could change the future of behavioral healthcare: A proposal for responsible development and evaluation | npj Mental Health Research GPT-4’s Schema of Depression: Explaining GPT-4’s Schema of Depression Using Machine Behavior AnalysisAdi’s Profile: Adithya V Ganesan - Google Scholar What did you think? Let us know.Do you have a question or a discussion topic for the AI Fundamentalists? Connect with them to comment on your favorite topics: LinkedIn - Episode summaries, shares of cited articles, and more. YouTube - Was it something that we said? Good. Share your favorite quotes. Visit our page - see past episodes and submit your feedback! It continues to inspire future episodes.
    --------  
    42:28
  • LLM scaling: Is GPT-5 near the end of exponential growth?
    The release of OpenAI GPT-5 marks a significant turning point in AI development, but maybe not the one most enthusiasts had envisioned. The latest version seems to reveal the natural ceiling of current language model capabilities with incremental rather than revolutionary improvements over GPT-4. Sid and Andrew call back to some of the model-building basics that have led to this point to give their assessment of the early days of the GPT-5 release.• AI's version of Moore's Law is slowing down dramatically with GPT-5• OpenAI appears to be experiencing an identity crisis, uncertain whether to target consumers or enterprises• Running out of human-written data is a fundamental barrier to continued exponential improvement• Synthetic data cannot provide the same quality as original human content• Health-related usage of LLMs presents particularly dangerous applications• Users developing dependencies on specific model behaviors face disruption when models change• Model outputs are now being verified rather than just inputs, representing a small improvement in safety• The next phase of AI development may involve revisiting reinforcement learning and expert systems* Review the GPT-5 system card for further informationFollow The AI Fundamentalists on your favorite podcast app for more discussions on the direction of generative AI and building better AI systems.This summary was AI-generated from the original transcript of the podcast that is linked to this episode.What did you think? Let us know.Do you have a question or a discussion topic for the AI Fundamentalists? Connect with them to comment on your favorite topics: LinkedIn - Episode summaries, shares of cited articles, and more. YouTube - Was it something that we said? Good. Share your favorite quotes. Visit our page - see past episodes and submit your feedback! It continues to inspire future episodes.
    --------  
    22:42
  • AI governance: Building smarter AI agents from the fundamentals, part 4
    Sid Mangalik and Andrew Clark explore the unique governance challenges of agentic AI systems, highlighting the compounding error rates, security risks, and hidden costs that organizations must address when implementing multi-step AI processes. Show notes:• Agentic AI systems require governance at every step: perception, reasoning, action, and learning• Error rates compound dramatically in multi-step processes - a 90% accurate model per step becomes only 65% accurate over four steps• Two-way information flow creates new security and confidentiality vulnerabilities. For example, targeted prompting to improve awareness comes at the cost of performance. (arXiv, May 24, 2025)• Traditional governance approaches are insufficient for the complexity of agentic systems• Organizations must implement granular monitoring, logging, and validation for each component• Human-in-the-loop oversight is not a substitute for robust governance frameworks• The true cost of agentic systems includes governance overhead, monitoring tools, and human expertiseMake sure you check out Part 1: Mechanism design, Part 2: Utility functions, and Part 3: Linear programming. If you're building agentic AI systems, we'd love to hear your questions and experiences. Contact us.What we're reading:We took reading "break" this episode to celebrate Sid! This month, he successfully defended his Ph.D. Thesis on "Psychological Health and Belief Measurement at Scale Through Language." Say congrats!>>What did you think? Let us know.Do you have a question or a discussion topic for the AI Fundamentalists? Connect with them to comment on your favorite topics: LinkedIn - Episode summaries, shares of cited articles, and more. YouTube - Was it something that we said? Good. Share your favorite quotes. Visit our page - see past episodes and submit your feedback! It continues to inspire future episodes.
    --------  
    37:25

Meer Zaken en persoonlijke financiën podcasts

Over The AI Fundamentalists

A podcast about the fundamentals of safe and resilient modeling systems behind the AI that impacts our lives and our businesses.
Podcast website

Luister naar The AI Fundamentalists, Jong Beleggen, de podcast en vele andere podcasts van over de hele wereld met de radio.net-app

Ontvang de gratis radio.net app

  • Zenders en podcasts om te bookmarken
  • Streamen via Wi-Fi of Bluetooth
  • Ondersteunt Carplay & Android Auto
  • Veel andere app-functies
Social
v7.23.9 | © 2007-2025 radio.de GmbH
Generated: 10/17/2025 - 7:25:33 PM