Explorez tous les épisodes de This Day in AI Podcast
Plongez dans la liste complète des épisodes de This Day in AI Podcast. Chaque épisode est catalogué accompagné de descriptions détaillées, ce qui facilite la recherche et l'exploration de sujets spécifiques. Suivez tous les épisodes de votre podcast préféré et ne manquez aucun contenu pertinent.
Rows per page:
50
1–50 of 103
Date
Titre
Durée
20 Oct 2023
EP37: Fun With PlayHT 2.0, Will Open Source Be Unbeatable? The Future of AI Models + Meta MEG
01:08:23
JOIN DISCORD HERE: https://discord.gg/BA7Rfx69
This week's podcast is an electric shock - new open source AI models like Zephyr are generating content at lightning speed. We dive into the implications of AI on everything from gambling to parenting as new tech lets you clone voices in seconds. Meta can now read your mind and turn brain waves into images, while AI judges sports better than humans ever could. Don't miss our demo of using AI to simulate a full-on financial phishing scam call!
CHAPTERS: ===== 00:00 - AI Chris with PlayHT 2.0 Turbo Cold Open 11:11 - Experience with PlayHT 2.0 Turbo & Thoughts 12:45 - Will Open Source Be Unbeatable? Zephyr 7B Alpha Road Tested 27:54 - OpenAI's Arrakis Failure: Is the Focus Now Small Models? Is Open Source Catching Up? 35:18 - GPT-4V Now Widely Available: Queue the Visual Prompt Injections! 41:03 - DALL-E Prompt Leaks & Yelling at AI in ALL CAPS to Make it Work 44:49 - Stack Overflow Layoffs & Future Disruption from AI: Are we Prepared for What is Coming? 57:12 - Meta's MEG: Reading Our Thoughts & Improve AI Neural Nets by Copying the Human Brain 1:00:16 - Are We Now Desensitized to AI Progress? 1:01:53 - AI Sports Analysis: Will it Change Sport?
Amazon Bedrock & Titan LLM, Generative Agents, AutoGPT, Gambling Update & Joe Rogan Deep Fakes | E10
01:05:41
In Episode 10 of This Day in AI Podcast we discuss Amazon's Bedrock and Titan release, what it means and cover the general availability of CodeWhisperer. We discuss the Generative Agents paper and simulation for predicting human behavior. Chris gives an update on his GamblingGPT and we cover the implications of deep fakes.
====
CHAPTERS: 00:00 - Joe Rogan Deep Fake Cold Open 00:17 - Amazon Bedrock, Amazon Titan LLM and CodeWhisperer News 16:42 - Open Source Dolly LLM from Databricks 19:31 - Costs of training models, latency and reliability of models 23:57 - What does this mean for OpenAI? 30:02 - AutoGPT, AgentGPT Example and AI Agents 34:51 - Generative Agents Paper: AI Simulations to Predict? 45:45 - Chris's GambleGPT Update: Can GPT-4 Vision Help? 51:47 - The future of AI Deep Fakes: Joe Rogan AI Podcast, Fake Kidnapping 1:01:24 - Reddit r/relationship_advice since ChatGPT. AI Agent Finding Dates. 1:03:39 - Germany Considers Banning ChatGPT
====
SUPPORT THE SHOW: If you like this podcast please consider subscribing whereever you listen to your podcasts and leaving a review, it helps others find this podcast and is really appreciated.
EP94: Does Grok 3 Change Everything? Plus Vibes & Diss Track Comparison
01:30:41
Join Simtheory: https://simtheory.ai ---- Grok 3 Dis Track (cringe): https://simulationtheory.ai/aff9ba04-ca0e-4572-84f4-687739c7b84b Grok 3 Dis Track written by Sonnet: https://simulationtheory.ai/edaed525-b9b6-473b-a6d6-f9cca9673868 ---- Community: https://thisdayinai.com ---- Chapters: 00:00 - First Impressions of Grok 3 10:00 - Discussion about Deep Search, Deep Research 24:28 - Market landscape: Is OpenAI Rattled by xAI's Grok 3? Rumors of GPT-4.5 and GPT-5 48:48 - Why does Grok and xAI Exist? Will anyone care about Grok 3 next week? 54:45 - Diss track battle with Grok 3 (re-written by Sonnet) & Model Tuning for Use Cases 1:07:50 - GPT-4.5 and Anthropic Claude Thinking Next Week? & Are we a podcast about Altavista? 1:13:25 - Economically productive agents & freaky muscular robot 1:22:00 - Final thoughts of the week 1:27:26 - Grok 3 Dis Track in Full (Sonnet Version)
Thanks for your support and listening!
05 May 2023
OpenAI Has No Moat, Google's Godfather of AI Quits & The Rise of Open Source LLMs | E13
01:01:53
In Episode 13 of This Day in AI We Discuss the Leaked Internal Google Document, "We Have No Moat, and Neither Does OpenAI", Discuss "The Godfather" of AI leaving Google, Discuss The Implications of Open Source LLM Advancement, Cover Open Assistant's Comparable Model to ChatGPT, OpenAI's Code Interrupter and Touch of Democratizing LLM Alignment.
If you like this episode please consider subscribing and liking the episode to help others discover the podcast.
TIMESTAMPS: ==== 00:00 - Godfather of AI Quits Google Cold Open 00:17 - Google & OpenAI Have No Moat: Advancement of Open Source LLMs 21:13 - 'Godfather of AI' Quits Google 30:54 - Open Assistant & Training LLMs using AI 40:52 - What Increased Prompt Sizes Mean & Long Range Transformers 50:49 - AI Agents, AI Disruption & Mark Zuckerberg's Agent Plan for Meta
8 Things to Know About LLMs, AI Natural Selection, AI Bill of Rights, Meta's Segment Anything | E09
01:13:13
About this episode: This week in Episode 9 of the "This Day in AI" Podcast We Discuss Two Important Papers: 8 Things to Know About Large Language Models and Natural Selection Favors AIs Over Humans. We Cover the Proposed AI Bill of Rights, Controlling AI with AI Constitutions and Mike Talks About His Personal AI Agent AGI Project in GPT-4. We Also Cover Meta's Breakthrough "Segment Anything" Release, AI Prepping, Open Source Models and How AI Has Brought Back the Early Internet Excitement.
If you like this podcast please consider subscribing and leaving a review on your preferred podcasting platform. It helps others discover it and we really appreciate it :).
00:00 - AI could change the world or kill us all 00:26 - 8 Things to Know About LLMs, Natural Selection Favors AIs Over Humans 03:24 - Emergent Skills from LLMs That Can't Be Predicted, Steering AI Models, OpenAI Innovations 06:56 - About AI Constitutions 09:50 - AI Bill of Rights, Will a Constitution Work? AI Natural Selection Discussion 19:13 - Mike's Basic AGI: GPT-4 Writing It's Own Code, Motivating the AI to Code 24:09 - Learnings from GPT-4 Programming 26:49 - BloombergGPT, AI Agents For Enterprise & Government 30:38 - How Do We Get to AGI From Here? AI Lies, Truths and LOLs 38:34 - AI Jibbery: Funny Names for AI Tricking Humans 39:56 - Humanizing the AI: Talking to a Computer 41:03 - Reviewing the Proposed AI Bill of Rights 45:56 - More on AI Pause, AI Job Risk & Can We Slow Down? 51:18 - Prepping for When They Take AI Away From Us: Are We Crazy? 55:55 - Meta AI Segment Anything Release: A Major Sensory Input! 1:04:23 - Is OpenAI's Moat in Trouble From Competing Models? Will OpenAI Stay Relevant? 1:09:35 - More Open Source Models! Koala: Berkeley AI Research 1:11:27 - AI Has Brought Back Early Internet Excitement
Six Month Pause on AI More Powerful Than GPT-4, ChatGPT Plugins, GPT-4 Development, GPT-4 AGI? | E08
01:04:52
Your two favorite luddites are back in Episode 8 of "This Week in AI" podcast. In this episode we cover the FutueOfLife.org Open Letter Asking for a Six Month Pause on AI More Powerful Than GPT-4, Talk About Implementing GPT-4 and It's Limitations, Cover Eliezer Yudkowsky's Doom Predictions in a TIME article, Discuss the Implications of ChatGPT Plugins, MisInformation and A Whole Lot More.
If you enjoy this podcast please consider subscribing and leaving a review to help spread the word. Thanks for listening.
00:00 - Powerful AI is GONNA KILL US ALL!? 00:16 - Six Month Pause on AI? FutureOfLife.org Open Letter 08:35 - Will They Take AI Models Away From Us? 12:30 - Implementing GPT-4 and Limitations of GPT-4 Access 18:07 - Eliezer Yudkowsky Says We're All Gonna Die From Powerful AI! 21:05 - How Will They Legislate AI if They Don't Understand TikTok? 22:06 - Is AI Development as Big as Nuclear Weapons? 23:40 - ChatGPT Plugins: One App To Rule Them All? 32:06 - ChatGPT IS The Killer App: Vector Database (Memory) & Embeddings 39:08 - GPT-4 Can Visualize Images From Imagining Them! 42:59 - Misinformation & Midjourney Access Stopped Due to Deep Fakes 44:45 - ChatGPT Plugins Hacked & DAN Plugin 47:06 - Disrupting OpenAI GPT-4 with Open Source Models 49:24 - Our Dear Friend Simon Willison: LLaMA 7B Model 51:46 - Large Language Models LLMs May Destroy Humanity 52:09 - Do We Know How to Fully Utilize GPT-4? 56:00 - Gambling with AI Update 1:00:21 - Sam Altman Lex Friedman Interview 1:03:00 - We Have #FREESYDNEY Mugs to Give Away!
Support the show by leaving a like, comment or sharing with a friend. We appreciate it!
DESCRIPTION: ======= This week we discuss what's happened since OpenAI's Dev Day: Sam Altman has stopped ChatGPT Plus Subscriptions Due to Demand, GPTs have been leaking their prompts and data, and thousands of people have been busy creating GPTs... but are they any good? We also discuss Microsoft AI Ignite and share our thoughts on Microsoft's new Azure Hardware, Microsoft CoPilot Studio, Azure AI Studio and all the other Microsoft AI Ignite News. We discuss can Open-Source AI Now Compete with GPT-4? And Cover Google Lyria Music AI and Meta's EMU Video and Emu Edit.
CHAPTERS: ======= 00:00 - OpenAI Stops Taking GPT Plus Subscribers. Subscription for sale on eBay 1:30 - Are GPTs Just a New Enthusiasm Phase, The Future or All Hype? 16:28 - Will GPTs just Become Functions and Processes with Proprietary Data? 22:53 - Early GPT Data Leaks & Unsafe Prompts 24:05 - Monetization of GPTs 29:24 - Microsoft AI Ignite: Azure Chips, Microsoft CoPilot Studio, Azure AI Studio 43:41 - Can Open-Source AI Now Compete with GPT-4? 48:27 - The OpenAI Dilemma: Microsoft & Open Source Threats 50:58 - What are The Killer Use Cases for AI? 56:50 - Google Lyria: The Future of Music Creation? 1:02:52 - Meta's EMU Video and Emu Edit AI research milestones.
EP92: o3-mini, Deep Research, Gemini 2.0 Flash & Pro + lols
01:46:27
Join Simtheory: https://simtheory.ai ---- "Don't Cha" Song: https://simulationtheory.ai/cbf4d5e6-82e4-4e84-91e7-3b48cb2744ef Spotify: https://open.spotify.com/track/4Q8dRV45WYfxePE7zi52iL?si=ed094fce41e54c8f Community: https://thisdayinai.com --- CHAPTERS: 00:00 - We're on Spotify! 01:06 - o3-mini release and initial impressions 18:37 - Reasoning models as agents 47:20 - OpenAI's Deep Research: impressions and what it means 1:12:20 - Addressing our Shilling for Sonnet & My Week with o1 Experience 1:20:18 - Gemini 2.0 Flash GA, Gemini 2.0 Pro Experimental + Other Google Updates 1:38:16 - LOL of week and final thoughts 1:43:39 - Don't Cha Song in Full
24 Nov 2023
EP42: What Did Sam Altman Do? Q* & AGI? LLM OS, Claude 2.1, Stable Video Diffusion and Suno Fun!
01:25:46
Join Our Discord: https://discord.gg/58HtZnVD Buy The Merch: https://www.thisdayinaimerch.com/
This week we reluctantly cover all the OpenAI drama and ask What Did Sam Altman Actually Do? Is Q* a path to AGI or just one big "look over here" distraction so we stop asking all these questions... We also cover Andrej Karpathy's LLM OS vision, discuss Claude 2.1 and how bad it's become thanks to "safety" and discuss our initial impressions of Stable Video Diffusion. Finally, we have some fun with Suno!
If you like this podcast, please consider subscribing and liking this episode. We appreciate the support.
CHAPTERS: ==== 00:00 - A Full Recap of What Happened with Sam Altman & OpenAI 10:06 - What Did Sam Altman Actually Do? 28:03 - What Did Ilya Really Discover? Is Q* A Big Distraction? How Far Ahead if OpenAI? 40:47 - Will This Drama Help Progress Open Source AI? 51:11 - Is Andrej Karpathy's LLM OS Vision The Future? 1:00:25 - Inflection-2 LLM 1:02:35 - Stable Video Diffusion Initial Thoughts 1:06:40 - Claude 2.1 Announcement 200K Context 1:21:26 Fun with Suno AI: Make Music with a Prompt
Thanks for helping us reach 2K subs here on YouTube!
This week we dive into the wild world of AI image generation and vision, from racist cartoon captions to heartfelt poetry written by Bing. We discuss the implications of teaching AI to forget unwanted knowledge, and debate whether safety controls are protecting users or limiting creativity. Get ready for philosophical ponderings, hilarious experiments, and our signature irreverent takes as we explore the latest AI advances and absurdities. Whether you're an expert or just fascinated by the future, this episode will challenge your thinking and give you plenty to discuss with friends.
CHAPTERS ====== 00:00 - Fooling Bing Vision to Solve Captcha 00:26 - Meta's Messenger AI Stickers Out of Control! AI Safety Discussion 06:17 - More Safety Nonsense: The Low-Resource Language Jailbreak GPT-4 Paper 9:36 - More on Mistral 7B (Safety and Positive Reception) 17:31 - Friends and Foes of Open Source AI & Is Anthropic a Crypto-like Scam for Billions? 21:26 - Turnitin Thinks It Can Detect AI, Being a Student in an AI World 24:25 - Stable 3B LLM Review and Cheese Test Results 38:48 - DALL-E 3 Road Test on ChatGPT & Diversity Prompt Injection Problems 48:12 - Using Bing GPT4-Vision to Solve Captchas for Grandma 51:01 - The Dawn of LLMs, Explorations with GPT-4Vision Paper + Possibilities of AI Vision 1:04:00 - Who's Harry Potter? Making LLMs forget 1:09:00 - Google Assistant with Bard AI 1:10:18 - LLaMA Long 32K Initial Thoughts 1:12:40 - Sydney Bing is Back BABY! 1:15:36 - Comments on Discord Rollout and Survey Response
Anthropic's 100K, Google IO: AI is Everything, Multimodal AI & AI Girlfriends | E14
01:00:44
Disclaimer: this description was written 100% by Anthropic's AI including the video title and text in the thumbnail. We have not modified it. Thanks Anthropic!
If you like this podcast please consider subscribing, liking the video and leaving us a comment. We really appreciate the support. ----- This week in AI is always packed with the latest news in artificial intelligence, but this episode delivers breakthroughs that are firmly in sci-fi territory. Chris and Michael discuss Anthropic releasing a 100,000-token context window for their language model—enabling it to understand entire novels or every customer record in your database at once. They explore what might be possible now, like AI systems writing convincing news articles, handling legal documents, or providing personalized healthcare recommendations with full medical histories.
We are also awed by Google's new AI capabilities, including models that can translate between any languages, handle complex reasoning or generate code. However, they're concerned by the company forcing it into their search engine and email—noting most users won't appreciate or understand interacting with an AI.
Finally, we dive into an eyebrow-raising use of AI: an influencer selling time with her "virtual girlfriend" chatbot for $1/minute. While concerning, the technology may point to AI companions becoming common for the elderly or lonely. Overall, this mind-blowing episode highlights how AI continues to shape the future at an exponential pace.
CHAPTERS: ==== 00:00 - Introduction 00:35 - Anthropic’s 100k context size announcement 21:58 - Google I/O announcements: PaLm 2, Bison & Makersuite 30:04 - Google Generative AI Search and Workspace Updates 39:37 - Google Search Perspectives 42:01 - Google Bard Vs ChatGPT 45:33 - Hugging Face Transformers Agent & Multimodal models 54:09 - AI model memory and learning 56:41 - The AI girlfriend chatbot
GPT-5? Prompt Injection Attacks, Apple AR AI Platform, Elon Musk Vs Larry Page on AI | E11
01:02:27
In this episode we cover the risks of Prompt Injection Attacks, look at new multi-modal open source models that fill the void from GPT-4 Vision, discuss the 60 Minutes Google Interview and Elon Musk Tucker Carlson Fox Interview, and ask is Apple VR/AR going to be the first true platform where AI becomes a part of our lives?
If you like this podcast please consider subscribing, liking, commenting, sharing and all that stuff that helps spread the word. We appreciate your support!
CHAPTERS: ==== 00:00 - Elon Musk on Larry Page AGI God (cold open) 00:15 - OpenAI not training GPT-5, Enhancing GPT-4 02:28 - WebLLM, Mini-GPT-4, LLaVA & Multi-Model LLMs & OpenAI Training Data Access 15:25 - Heart of My Sleeve: The Weeknd/Drake AI Song Taken Down. 16:48 - Paying for Access to APIs for AI Training Data: Stackoverflow, Reddit, Twitter 17:51 - Prompt Injection Attacks, Phishing, Bots and Risks 32:26 - Google's 60 Minutes Puff Piece. Is AI Search Boring? 35:55 - Elon Musk Fox Interview & Larry Page's Digital God 43:11 - Apple's VR/AR Headset as an AI Platform, Education and Using Siri to Build Worlds with AI 52:38 - Building Robot Brains: LLM as a Robotic Brain
EP33: AI WARS: Gemini Vs Gobi, DALL-E3, Alexa AI, Open Interpreter & Llama2 Experiments
01:05:12
Do you want to join our Discord community? Fill in this: https://forms.gle/k8TyUeWKGWHFBzwQ9.
The AI wars are heating up as Google and OpenAI race to release the first multimodal LLM. We build a sync sub video game with just one prompt using open Interpreter. Alexa shows off scary new conversational abilities, while poets sell out to Big Tech. Join us for the latest AI battles - but don't get your hopes up, it's not that exciting!
If you like the show please consider sharing with friends and leaving a comment.
28 Mar 2024
EP56: We Wrote a Song! Claude Opus is 👑, Gemini 1.5 Pro & Ultra API Experiments
01:26:53
Show notes: https://thisdayinai.com/bookmarks/45-ep56 Try Gemini 1.5 Pro on SimTheory: https://simtheory.ai/agent/865-google-gemini-15-your-ultimate-assistant Try Gemini Ultra on SimTheory: https://simtheory.ai/agent/866-google-gemini-ultra-the-apex-of-ai-conversation Join our community: https://thisdayinai.com
CHAPTERS ===== 00:00 - Fun with Suno v3 10:38 - We Have Google Gemini 1.5 Pro API, Google Ultra API Access! 26:21 - Claude Opus is the King According to LMSYS Chatbot Arena Leaderboard 38:25 - The Sink Sub Coding Challenge with Opus, Gemini 1.5 Pro and Gemini Ultra + Building Salesforce CRM with AI 50:06 - Amazon Invest More Billions in Anthropic 53:03 - Hume AI: Empathic AI Voice & Vision Understanding 1:01:06 - Inflection AI Absorbed into Microsoft, Microsoft is below, above and around all top AI labs. 1:09:28 - Does AI Help Students Learn? Maybe Not? 1:17:37 - Stable Code Instruct 3B, a good local coding model? 1:23:12 - Our AI Songs in Full!
Thanks for listening, please consider subbing, liking, commenting - we love hearing from you.
01 Dec 2023
EP43: Is GPT-4 Lazy? Wizard 33B, Qwen 72B Tested & Self Operation AI Computer
01:12:36
Join the discord: https://discord.gg/27mQ9cut Get the merch: https://thisdayinaimerch.com
This week we celebrate ChatGPT's 1 Year Anniversary and Ask is GPT-4 Lazy? We explore the best of open source with Wizard 33B and test China's Qwen 72B model from Alibaba. Chris tries to delete all files from his computer using Self Operation AI Computer and we cover Amazon's AWS Ignite AI announcements, Stability Diffusion XL Turbo, The Scalable Extraction Attack on ChatGPT and an exciting waitlist release from PIKA.
Like, sub, comment if you enjoy the episode to support the show. We love hearing from you.
CHAPTERS: ===== 00:00 - Cold Open 00:08 - ChatGPT 1 Year Anniversary 07:54 - Is GPT-4 Lazy? Is Claude Unusable Now? 18:43 - Are Open-Source Models Catching Up 1 Year On? 21:57 - Wizard 33B Open-Source Model 24:55 - Demo of Wizard 33B 28:26 - China's Qwen 72B Open-Source Model 31:26 - Qwen Demo 38:16 - Self Operation Computer Discussion & The Future of AI With Access to Computers 49:23 - Scalable Extraction: DeepMind's COMPANY attack to extract training data from ChatGPT 55:20 - Stability Diffusion XL Turbo, Stability's Stability & Commercial Subscriptions 1:03:23 - Amazon's AWS Ignite: Amazon Q, Trainium 2, Bedrock Fine Tuning 1:07:49 - PIKA Video 1:09:26 - Important News
Nvidia & The AI Goldrush, Neuralink Human Trials, LLM Specialization & Meta's MEGABYTE | E16
01:01:43
Neurolink's Brain Chip Gets FDA Approval, NVIDIA Stock Skyrockets, and OpenAI wants to regulate everyone but themselves. We discuss LLM specialization and fine-tuned models like LIMA, Gorilla and ask is less training data more? We also discuss whether we would get the Neuralink brain implant. The future is arriving fast in AI this week and we try our best to keep you up to date.
---- If you like this podcast help us spread the word. Please consider leaving a comment, liking and subscribing if you haven't already. ----
00:00 - NVIDIA, Trillion Dollar AI Company? 08:42 - Neuralink gets FDA approval for Human Trials 15:30 - Can AI Read Your Thoughts? We Discuss Mind-Video 20:12 - Sam Altman's World Regulation Tour 23:25 - Governance of Superintelligence and Democratic AI 29:16 - Meta's MEGABYTE: The Future of AI? 34:36 - Specialized LLMs: Gorilla, LIMA 48:34 - Adobe's Generative Fill in Photoshop 54:23 - Microsoft CoPilot for Windows 11: Will it be Safe from Prompt Injection? 57:42 - AI Training use Game Mechanics & More of AI Gaming
EP32: Does AI Remember Your Unethical Requests? Chuck's AI Forum, Robot Ethics, & LLM Deception
01:06:29
This week's episode is an absolute barnstormer, covering everything from robots burning in stadium fires to AI girlfriends with dangerous memories. Get ready for an action-packed ride as we dive into the dark realities of AIs keeping naughty lists, journalism being taken over by plagiarizing robots, and whether downloading your brain into an android body means you can laugh in the face of death. Buckle up and grab some popcorn, because this week's episode is one wild ride from start to finish!
(Written by AI lol)
If you like the pod please support us by leaving a review wherever you get your podcasts and sharing with friends.
CHAPTERS ==== 00:00 - "What if I could download your soul?" Cold Open 00:56 - Chuck's AI Forum, Regulation and What We Should Be Focusing On 11:02 - Deceptive Abilities Emerging in LLM Paper Discussion 24:03 - Large Language Models and Optimizers: Take a Deep Breath 30:50 - 5 Years to Discover Capabilities of Current Models 33:52 - a16z Report on How Consumer are Using LLMs 39:41 - Are Your Androids Going to Be Criminals? Implications of AI Robots in Society 47:25 - US Copyright Offices Denies AI Created Image Copyright & Microsoft Will Legally Defend Paid Users of AI CoPilot 55:27 - Stable Audio: Mike's Paid Customer Stable Audio Experience 59:48 - Open Interrupter: Open-Source Version of OpenAI's Code Interrupter 1:02:22 - ChatGPT Journalist Leaves Prompt in Article. LOLs.
EP36: ChatGPT Vision Road Tested, AutoGen Cheese Test & Anthropic's Break Through
01:12:46
Join the discord: https://discord.gg/bb6VZHks
This mind-blowing episode explores the shocking capabilities of GPT-4 vision, including how it identified Mike's exact location just from a traffic photo. We dive into the cheese-filled insanity of using multiple AI agents together with AutoGen, and discuss Anthropic's groundbreaking research into neural superposition. Don't miss our dramatic exposé of Meta's new lobotomized AI chatbots - this episode takes you on a wild ride through the cutting edge of AI!
If you like this episode please consider subscribing and leaving a comment or review.
CHAPTERS ===== 00:00 - Shakespearean AutoGen DRs 00:29 - ChatGPT Vision Road Tested, Augmented Intelligence & Comparison to LLaVA Vision 31:25 - The Cost of AI: Is There A Business Model With Margin? 36:52 - The Value of AI is in Productivity: Discussion of Business Models 43:19 - AutoGen Agents Cheese Test: Do Multi-Agents Perform Better? 58:09 - Anthropic's AI Break Through: Decomposing Language Models Into Understandable Components
EP97: Moore’s Law for AI agents, OpenAI's new audio models, o1-pro API & When Will AI Replace Us?
01:37:18
Create an AI workspace on Simtheory: https://simtheory.ai --- Song: https://simulationtheory.ai/f6d643e4-4201-475c-aa82-8a96b6b3b215 --- CHAPTERS: 00:00 - OpenAI's audio model updates: gpt-4o-transcribe, gpt-4o-mini-tts 18:39 - Strategy of AI Labs with Agent SDKs and Model "stacks" and limitations of voice 25:28 - Cost of models, GPT-4.5, o1-pro api release thoughts 31:57 - o1-pro "I am rich" track & Chris's o1-pro PR stunt realization, more thoughts on o1 family 48:39 - Moore’s Law for AI agents, current AI workflows and future enterprise agent workflows & AI agent job losses 1:24:09 - Can we control agents? 1:29:21 - Final thoughts for the week 1:35:15 - Full "I am rich" o1-pro track --- See you next week and thanks for your support.
CORRECTION: Kosciusko is obviously not an aboriginal name I misspoke. Wagga Wagga and others in the voice clip are and are great ways to test AI text to speech models!
06 Nov 2023
LIVE: Reaction to OpenAI DevDay, Opening Keynote
01:17:40
This is a recording of the live event on YouTube following the OpenAI DevDay keynote. We'll be back with a regular episode later this week.
Sharkey and Sharkey amped up on caffeine live react to OpenAI's latest announcements. Cost reductions, larger models, and an app store?! The duo banter and bicker about whether this marks excitement or irrelevance for devs like you. Plus Elon Musk teases a GPT-style model without the handcuffs - does this spell trouble for Big Sam? Sharkey and Sharkey think out loud and solicit hot takes from listeners on the implications.
EP79: Fun with ChatGPT Advanced Voice Mode & Which Models Do People Actually Use?
01:12:48
Join Simtheory: https://simtheory.ai Community: https://thisdayinai.com ----- Thanks for listening and all of your support of the show! ----- CHAPTERS: 00:00 - Fun with ChatGPT Advanced Voice Mode & Moshi 04:11 - Thoughts on Advanced Voice Mode, Voice Mode API & Voice as an Interface 29:31 - We Share Simtheory.ai Model Usage Data: Forget Benchmarks... Which Models Do People Actually Use? 38:35 - Llama 3.2 with Vision: Thoughts on New Models and Llama Stack 55:02 - Google Gemini 1.5 Pro 002 Update: Thoughts on New Model 1:04:56 - OpenAI achieves AGI and Fires All Executives 1:08:06 - Mike's Weekly LOL
07 Jul 2023
Aligning Super Intelligence & The Open Web's Uncertain Future + GPT-4 General Availability | EP22
01:08:07
In Episode 22 we cover GPT-4 general availability and the implications to millions of developers. We discuss George Hotz revealing how GPT-4 works & Alignment of Super Intelligence. Will OpenAI's Super Intelligence succeed? We also cover the implications of AI agents on the open web, OpenAI disabling web browsing in ChatGPT temporarily to protect content creators and discover Jolly Roger's AI time wasting for service to trick scammers.
If you like this podcast please consider subscribing, liking and leaving us a comment. We really appreciate your support of the show.
CHAPTERS: ---- 00:00 - Cold open 00:27 - GPT-4 API General Availability for Developers 06:40 - ChatGPT Code Interrupter 15:30 - ChatGPT as Work Assistant: Code Interrupter + Vision 16:42 - ChatGPT web traffic down by 10% & thoughts on single agent Vs AI everywhere 21:03 - George Hotz leaking GPT-4 is a 16 way mixture model 26:50 - Is AI alignment just a human alignment problem? 27:57 - Is alignment making GPT-4 and ChatGPT worse? Discuss on AI alignment 38:18 - Is the best defense against super intelligent AI giving everyone super intelligent AI? 40:55 - Is the Open Web Doomed? AI's impact on the Open Web 54:42 - OpenAI's 20% compute power to "Superalignment" 1:00:29 - OpenAI now sends email threats 1:03:12 - Waste Scammers Time with This AI Voice Tool
Is Bing's Sydney Still Unhinged? Amazon Multimodal, AI a Threat to Humanity? OpenAI Foundry | E03
01:01:06
In episode #03 of the This Day in AI Podcast we talk about Microsoft's Updates to the Bing ChatBot and Discover if Sydney is Still Unhinged. We Cover New Models like Amazon's Multimodal, Discuss AI Memories and How Soon Before AI is a Threat to Humanity. Learn About OpenAI Foundry, RunwayML and AI in the Enterprise thanks to OpenAI's Partnership with Bain & Company.
00:00 - AI a Threat to Humanity? 01:05 - Is Bing Chatbot "Sydney" still Unhinged? 03:03 - Bing's Response to Kevin Roose 06:50 - Is Bing Chatbot Evil? 8:06 - Is the Internet Bing AI Chatbots Memory? Will AI develop Memory? 10:15 - Bing's Sydney Chatbot Making Threats Again 13:34 - Is AI Unhinged Because of Social Media? 14:14 - Prompt Injection Cat & Mouse 14:52 - Is ChatGPT Sentient? Learning with Prompts 17:26 - The Skill of Prompting AI and What is Coming 20:00 - AI Wars: Bing Vs Bard, Is Google Seriously In Trouble? 24:36 - Amazon AI Model: Multimodal Chain-of-Thought Reasoning 31:51 - OpenAI Foundry. What is AWS thinking? Google Cloud AI? 37:26 - Open AI Partnership with Bain & Company. AI in the Enterprise. 41:05 - Will All Neural Net Models Join to Create a Super AI? 43:35 - How Long Until AI is a Threat to Humanity (Continued) 47:10 - How will Enterprise AI handle hallucinations? 49:58 - Influencing AI Through Training Data. AI SEO? 53:25 - RunwayML: Zero Budget CGI for Film Makers 56:20 - Microsoft BioGPT: Specialized AI Models 59:15 - BasedGPT Prompt Injection: How would you take over the world?
Create a Simtheory workspace: https://simtheory.ai Compare models: https://simtheory.ai/models/ ------ 3d City Planner App (Example from show): https://simulationtheory.ai/8cfa6102-ed37-4c47-bc73-d057ba9873bd ------ CHAPTERS: 00:00 - AI Fashion 01:13 - Gemini 2.5 Pro Initial Impressions: We're Impressed! 38:24 - Thoughts of Gemini distribution and our daily workflows 55:49 - OpenAI's GPT-4o Image Generation: thoughts & examples 1:13:52 - Gemini 2.5 Pro Boom Factor 1:18:38 - Average rant on vibe coding and the future of AI tooling ------ Disclaimer: this video was not sponsored by Google... it's a joke.
Thanks for listening!
21 Jul 2023
Llama 2 Unleashed, Open-Source Vs OpenAI, Instructions for ChatGPT & Fine Tuning with Llama2 | EP24
01:16:09
In Episode 24 we discuss the release of Llama 2, what it means for the open-source community, developers and the future of AI LLMs with it being commercially available. We discuss what threat open-source models now pose to the business models of OpenAI and others, and also take a look at Instructions for ChatGPT. We have a long discussion on fine tuning smaller models and cover news including Google's Generative AI Enterprise Search product, Bing Enterprise Chat, AppleGPT rumors and finally end with the state of AI startups, fundraising, VC and what makes good AI investment.
If you like this podcast, please consider leaving us a review and sharing with friends/family.
CHAPTERS:
00:00 - Lizard Man is now a Hero (cold open) 00:49 - Meta’s Llama 2 release and what it means 21:18 - Llama 2 alignment, Is fine tuning just censorship? 27:02 - What is Meta’s strategy with Llama 2? Will it hurt OpenAI? 31:56 - OpenAI fears GPT-4 Vision will offend someone 40:40 - ChatGPT Plus Users get GPT-4 Rate Limits Doubled 41:56 - ChatGPT Custom Instructions 46:15 - Does Llama Threaten OpenAI with Smaller Refined Fine Tuned Models? 54:39 - Google’s Generative AI Enterprise Search (Gen App Builder) Announcement 56:16 - Microsoft’s Bing Chat Enterprise 1:02:19 - AppleGPT “AJAX” rumor 1:05:49 - The State of AI Startups, VC and fundraising 1:12:32 - AI LOLs & Memes
EP58: We Convinced a Record Label to Sign an AI Artist + Udio AI Music, Gemini 1.5 Pro, GPT-4 TURBO, Mixtral
01:09:53
AI News: https://thisdayinai.com SimTheory: https://simtheory.ai Show Notes: https://thisdayinai.com/bookmarks/48-ep58 -------
CHAPTERS: 00:00 - Udio, Udio Examples 10:45 - Will a Record Label Sign an AI Udio Artist? 19:09 - 3 Major LLM Updates/Release in a Single Day 22:58 - Google Gemini 1.5 Pro General Availability, Audio Modality & Impressions 30:20 - Google Cloud Next 2024 AI Announcements Discussion 47:18 - OpenAI Announces "improvements" to GPT-4 Turbo, GPT-4 Turbo Official Release & Vision API JSON & Function Calling 57:35 - Mistral Posts BitTorrent To New Open Source Model Mixtral-8-22B 1:03:00 - Humane's AI Pin Reviews are out... and they aren't great.
Special thanks to AI artist Conor for the great content!
Thanks for listening.
15 Mar 2024
EP55: Will Devin Take Our Jobs? Sora Interview, Claude Haiku, DeepSeek 7B, Figure1 & Robot Slavery
==== CHAPTERS 00:00 - OpenAI CTO Mira Murati Sora Interview Train Wreck 16:47 - EU Passes the AI Act 24:25 - 1 year since Greg Brockman Unveiled GPT-4 + Cognition's Devin 52:34 - Anthropic Releases Claude 3: Haiku & It's REALLY GOOD! 1:05:20 - DeepSeek-7B Real World Vision Language Understanding 1:16:09 - It's all about the training data, why Tesla might win Robotics & Vision 1:17:27 - Figure1 Robot with OpenAI for Vision and Language + Discussion on Robot Slavery ====
Please consider subscribing if you like the podcast! Thanks for listening.
Try SimTheory Beta: https://simtheory.ai/chat Show Notes: https://thisdayinai.com/bookmarks/63-ep70 Join our community: https://thisdayinai.com Merch: https://www.thisdayinaimerch.com/ ===== Thanks for listening! ===== CHAPTERS: 00:00 - It's good to be back... 04:12 - Chris's Learnings From Playing Poker Using AI 32:11 - Initial thoughts on GPT-4o Mini from OpenAI 44:15 - Mistral's NeMo 55:01 - Codestral Mamba 1:04:48 - MathΣtral: Scientific Discovery or BS? 1:12:59 - New Models on LMSYS: Column-r Column-u, Eureka by Google 1:09:22 - BOOM FACTOR for new models 1:16:39 - JD Vance Doesn't Want AI Regulatory Capture 1:18:41 - Final thoughts
03 May 2024
EP61: What is GPT2-chatbot? MoE Theories, ChatGPT Search, Virtual Try On & Fine-Tuning Experts
01:23:05
Show Notes: https://thisdayinai.com/bookmarks/53-ep61 Community: https://thisdayinai.com SimTheory: https://simtheory.ai
Thanks for watching, if you like the show please consider subscribing, liking and all the stuff lord youtube requires.
CHAPTERS: ---- 00:00 - GPT2-chatbot: What could GPT2 Be? Is This GPT4.5 or GPT-5? 37:08 - Is OpenAI about to take on Google & Perplexity with Search? ChatGPT Search? 52:15 - Fun with Virtual Try On: IDM-VTON 1:01:30 - Anthropic Releases Claude App for iOS & Claude Teams. Should you lock your team to a single model? 1:08:37 - GeoSpy AI Hype & reality check 1:15:21 - World's First AI Music Video Using OpenAI's SORA
31 Oct 2024
EP83: Self Driving Computers, plus SearchGPT, & Github Copilot with Sonnet
01:03:30
Join Simtheory: https://simtheory.ai Community: https://thisdayinai.com === CHAPTERS: 00:00 - What happens with we have Self Driving Computers? 33:00 - ChatGPT Search Comparison & Thoughts 48:47 - Github Copilot Goes Multi-Model with Sonnet 3.5
Thanks for listening!
17 Jan 2024
EP47: GPT-5 Rumors, AutoGen Studio, SeeAct Web Agents, Google AMIE, Anthropic’s Sleeper Agents
DESCRIPTION ==== In this episode, we dive into the buzz around GPT-5, sparked by Sam Altman's revelations on Bill Gates' latest podcast. We share our top hopes and dreams for GPT-5 and future AI advancements. Next, we delve into Microsoft's new CoPilot Pro Subscription, exploring how it stands out from ChatGPT Plus. Chris takes AutoGen Studio for a spin and ponders over its ideal user base. The episode then shifts to the intriguing concept of collaborative AI agents - is this the path to AI's mastering reasoning, reflection, and profound thought? We dissect the insights from the SeeAct Web Agents study, assessing its influence on AI agent development. Shifting gears, we discuss Google AMIE's groundbreaking ability to outperform doctors in diagnoses, even those assisted by AI. To wrap up, we spotlight the significance of Anthropic's Sleeper Agents experiment and its groundbreaking findings.
Thanks for listening. Please consider subscribing if you haven't already and leaving a review. We appreciate all of your support!
CHAPTERS: ==== 00:00 - Cold Open 00:31 - GTP-5 Rumors & Leaks 07:32 - Microsoft CoPilot Pro 22:27 - Microsoft's AutoGen Studio: An open-source UI for AutoGen 38:53 - The Future of AI Agents? LAMs and SeeACT Web Agent Paper 1:00:19 - Google AMIE: Can AI Replace Doctors for Diagnosis? 1:13:12 -Anthropic's Sleep Agents Experiment
EPo99.02-experimental: OpenAI's Gaggle of Models: o3, o4-mini & GPT-4.1 & Future GPT-5 Systems
01:31:06
Join Simtheory: https://simtheory.ai like and sub xoxox ---- 00:00 - Initial reactions to Gaggle of Model Releases 09:29 - Is this the beginning of future GPT-5 AI systems? 47:10 - GPT-4.1, o3, o4-mini model details & thoughts 58:42 - Model comparisons with lunar injection 1:03:17 - AI Rap Battle Test: o3 Diss Track "Greg's Back" 1:08:12 - Thoughts on using new models + Gemini 2.5 Pro quirks 1:10:54 - The next model test: chained tool calling & lock in 1:14:43 - OpenAI releases Codex CLI: impressions/thoughts 1:18:45 - Final thoughts & help us with crazy presentation ideas ---- Links from Discord:
EP73: Has Google Done It? Grok 2 Beta & Is Tuning All You Need?
01:24:54
Sign up to Simtheory: https://simtheory.ai ------- 00:00 - Reactions to #madebygoogle and Gemini Live 15:30 - Grok 2 Beta Tested & Is Grok Getting Flux Credit? 39:03 - Future of Personalized Software in Education & The Workplace: Are Devs Still Needed? 1:02:16 - Claude's Prompt Caching Explained 1:11:18 - Hermes 3 (Llama 3.1 Fine-tuned for instruction following) 1:19:18 - Is Tuning All You Need? Why Claude Sonnet 3.5 is so good.
Thanks for watching/listening/subscribing/liking/commenting and reviewing our average podcast each week. It means a lot to us.
You can join our community here: https://thisdayinai.com or try Simtheory: https://simtheory.ai.
23 Aug 2024
EP74: Human Eggs with Ideogram 2.0, Phi 3.5 Boom Factor + AI-Free Startups
01:13:09
Sign up to Simtheory for an AI workspace: https://simtheory.ai Try ideogram 2.0 on Simtheory --- CHAPTERS: 00:00 - Ideogram 2.0: Your new AI graphics designer? 23:46 - Microsoft Phi 3.5 Initial Impressions & Thoughts + Boom Factor 38:51 - AI workspace productivity: how much is your productivity worth? 55:08 - Procreate's Anti AI Movement: Marketing or a New Category? 1:07:06 - Chris's thoughts on Phi-3.5 Fine Tuning & Lack of Documentation, Accessibility of Models to Try --- To see images from the show join our Discord community: https://thisdayinai.com
Show notes: https://thisdayinai.com/bookmarks/68-ep74
Thanks for listening, your comments, reviews and support of the show. We really appreciate it and love hearing from you.
PS. Tasmanian YouTuber Chris mentions: https://www.youtube.com/@UCalOFVbIxEAWIV5LHGkKcnw
04 Aug 2023
EP26: Software Teams Replaced for $1.40, Doctors Out-Diagnosed, Meta's Audio Craft, ChatGPT Updates
01:09:48
This week we dive into the brave new world of AI agents teaming up to do real work - from building video games to diagnosing patients! But will these digital workforces put humans out of jobs? We discuss the AI takeover of industries like medicine and software, plus exciting updates like AI-generated music and Google giving their Assistant a complete AI makeover.
We also cover Meta's Audio Craft, Med-Flamingo, GPT-5 Trademark and Rumors, and The SF Compute Company.
Thanks for your likes, comments and support.
CHAPTERS: ===== 00:00 - Self-Diagnosing Medical Problems Is Here 00:28 - MetaGPT: Multi-agent Collaborative Framework and The Multi-Agent Future 11:39 - Will Agents Replace Software in the Future? 14:59 - The flood of new LLMs 20:09 - Med-Flamingo: Have Virtual Doctors Arrived? 34:06 - Martin Shkreli's Dr. Gupta & Disrupting Medical Diagnosis 38:45 - Everyone is Using AI Already for Medical Diagnosis 39:29 - Focusing on Higher Level Work 40:53 - Meta's Audio Craft: Create Sounds and Music with Open Source AI 44:53 - Will Spotify Cut Out Artists to Increase Profits with AI Music? 48:07 - Will Entrenched Professionals Slow Down the Benefit of AI? 51:37 - ChatGPT Updates: GPT-4 as Default, Suggested Replies, Prompt Examples, Stay Logged In! 57:05 - The San Francisco Compute Group: The A100 Cooperative! 1:00:42 - GPT-5 Trademark has been registered! 1:01:48 - Google Assistant Powered by AI LLM Leaked in Letter 1:05:02 - The Future of Websites: LLMs for Businesses and Brands
EP28: What is Poop? Is Generative AI a Dud? Will OpenAI Go Bankrupt? + Llama2 Uncensored
01:09:25
This week your favorite AI bros go deep on the BIG LIES - how big tech and the mainstream narrative are trying to SILENCE revolutionary AI models that threaten to EXPOSE inconvenient truths! Tune in as Chris and Michael SHRED Meta's attempt to gag their new science AI Galactica, and discuss the CENSORSHIP built into aligned models like Claude and GPT-4. Don't miss their hilarious takedown of noted AI alarmist Gary Marcus - his latest flip-flop proves generative AI is HERE TO STAY! The truth will not be suppressed!
(Note description written by AI for lols)
Thanks for helping us reach our goal of 100+ reviews on Apple Podcasts. It means a lot to us!
CHAPTERS ===== 00:00 - What is a Poop? 01:08 - Is Generative AI a Dud? 23:32 - OpenAI Acquires Global Illumination to work on ChatGPT 31:12 - Anthropic Raises $100M from Korean SK Telecom 37:15 - LLAMA2 Uncensored: Censorship, Misinformation and the Battle for Truth 48:31 - Meta's AI Trained on 48M Science Papers Shut Down After 2 Days
Thanks for listening and all of your support, we appreciate it.
-------- CHAPTERS: 00:00 - Google's AI Overview Fails 18:50 - The Reality of Using AI 25:02 - Two weeks of GPT-4o 34:36 - Microsoft Build: CoPilot+ PCs, Recall, AI Narks, Phi-Silica, Team CoPilot, CoPilot with GPT-4o Voice 54:41 - Phi-3-Vision Testing 57:26 - Mistral-7B v0.3 Uncensored with Function Calling Testing 1:08:55 - AI Startup Bubble? Lots of AI Startups looking for buyers... 1:15:37 - Help us get this song to #1 on Udio!
11 Apr 2025
EP99.01: Google Cloud Next, Agent2Agent, MCPs, Agent Development Kit, Is Llama4 a flop? & Grok API
01:42:45
Join Simtheory: https://simtheory.ai -- Get the official Simtheory hat: https://simulationtheory.ai/689e11b3-d488-4238-b9b6-82aded04fbe6 --- CHAPTERS: 00:00 - The Wrong Pendant? 02:34 - Agent2Agent Protocol, What is It? Implications and Future Agents 48:43 - Agent Development Kit (ADK) 57:50 - AI Agents Marketplace by Google Cloud 1:00:46 - Firebase Studio is very broken... 1:06:30 - Vibing with AI for everything.. not just vibe code 1:15:10 - Gemini 2.5 Flash, Live API and Veo2 1:17:45 - Is Llama 4 a flop? 1:27:25 - Grok 3 API Released without vision priced like Sonnet 3.7 --- Thanks for listening and your support!
02 Nov 2023
EP39: White House AI Executive Order, The Bletchley Declaration & Adversarial AI Attacks
01:06:59
Join our Discord: https://discord.gg/TRrgAyeM Buy the merch: https://www.thisdayinaimerch.com/
This week the AI guys unpack the White House's sweeping executive order on regulating AI - will this lead to the death of open-source models? They also discuss the vague and fluffy Bletchley Declaration signed by world leaders, why Geoffrey Hinton just won't stop fearmongering, and introduce some hilarious new merch including a life-size shower curtain! Tune in for hot takes on the AI ethics debate, prompt engineering tricks, and key insights on the future of language models.
CHAPTERS: ===== 00:00 - King Charles on AI (Cold Open) 00:20 - Thoughts on White House AI Executive Order 23:09 - The Bletchley Declaration & AI Safety Summit 38:04 - LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2 & They Killed Tay! 48:34 - Adversarial Attacks and Defenses in Large Language Models: Old and New Threats Paper 51:51 - Mike proposes What The Future of AI Computing Might Look Like 55:00 - Leaked: The Secret Prompt Powering ChatGPT's New Multi-Tool Mode (and How to Hack It)
1:01:39 - Anthropic Have Raised More Billions & Our Merch Store!
EP90: Consoles of the Future, Deepseek-R1, Gemini-Thinking, Create with Code, Operator & Stargate
01:18:03
Join Simtheory and use Create with Code: https://simtheory.ai Community: https://thisdayinai.com ----- VOTING: Code challenge winner: https://simulationtheory.ai/e2875acf-33c3-4a96-97a1-14b8d815a49a BOOM factor on Deepseek: https://simulationtheory.ai/f98a591d-7726-42d3-afaf-53b8c29fa21e ----- CHAPTERS: 00:00 - Computer Use, Agents, Collaborators Discussion 37:12 - Deepseek-R1 Thinking Model on Par with o1/Claude Sonnet 43:59 - Google Gemini Flash Thinking Experimental Updated 48:18 - Code Challenge: Deepseek R1 Vs Gemini Flash Thinking Vs o1 Vs Sonnet 1:02:37 - OpenAI Announces Stargate Project 1:10:10 - Deepseek-R1 BOOM FACTOR ------ Thanks for listening.... plz like, comment, sub.
13 May 2024
LIVE: OpenAI Spring Event (Post Event Reaction)
00:59:27
LIVE after the OpenAI Spring Update Event! Hear our initial reaction to GPT-4o and "Her" like virtual assistant with low latency voice.
More testing/discussion coming later in the week.
Community: https://thisdayinai.com.
02 Jun 2023
Will AI Make Humans Extinct? Did GPT-4 Get Worse? & OpenAI's GPT-4 Roadmap Revealed | E17
01:05:16
This week we cover the Center for AI Safety's Statement and ask, "Will AI Really Make Humans Extinct?" We cover the discussion on GPT-4 Deteriorating: is it really getting worse? And discuss some likely causes.
We also use the Internet Archive to discuss OpenAI's GPT-4 roadmap including a stateful API, giant context sizes, plugins having no product market fit and why OpenAI is constrained by GPUs.
We also cover the lawyer who got caught using ChatGPT and the problems with hallucinations. Will step-by-step rewards help solve these problems? Do we need better warnings for stupid lawyers?
If you like this episode please consider subscribing, comment and liking to help others find our podcast. We appreciate your support.
---- Chapters: ---- 00:00 - Is GPT-4 Getting Worse? 00:20 - Center for AI Safety Statement: Will AI Make Humans Extinct? 12:09 - Is GPT-4 Getting Worse? Is Alignment the Problem? 29:27 - GPT-4 Roadmap: 1M Context? Multi-modal? 36:11 - Vision of ChatGPT: Have Plugins Failed? 38:02 - Will OpenAI API's Have Too Much Competition? 40:21 - Giant AI Models Aren't Over: Is GPT-4 Much More Capable? 44:39 - ChatGPT for Lawyers: What Could Possibly go Wrong? 48:17 - The Hallucination Problem 53:22 - Process Supervision: Solution to Hallucination? 55:59 - Japan Removes Copyright Laws for AI Progress 57:53 - UAE's Falcon 40B Open Source & Royalty Free! 1:01:04 - Will We Value Written Language?
Inflection's $1.3B AI, OpenAI Competition, MidJourney Zoom, Fun with GPT-4 Vision | E21
01:06:45
In Episode 21 we discussed Inflection AI’s $1.3B fundraise from Greylock, Microsoft, Bill Gates, Eric Schmidt and the increasing competition to OpenAI’s dominance. We discuss the makeup of future AI Agents and look at the emerging race to build the world’s most powerful AI.
We also cover GPT-4 Vision LOLs and ask if Sydney is back with Bing’s GPT-4 Vision Implementation and discuss the latest release from MidJourney, Stable Diffusion and why Nikon is begging people to take REAL photos!
CHAPTERS --- 00:00 - LOLs from GPT-4 Vision on Bing Chat 00:39 - Inflection AI's $1.3B Fundraise (and Pi) & The Race to Build the World's Most Powerful AI 11:07 - Divergent Future Visions of AI Agents/Assistants & ChatGPT Workspaces Leak 17:06 - Future AI "stacks", Custom LLM Training as a Service, Comparing LLMs 28:10 - MosaicML Acquired by DataBricks for $1.3B & Private LLM training, Agents Creating Specialized LLMs 34:00 - Are LLM APIs becoming commoditized? Is the ChatGPT brand and distribution the most valuable thing? 40:56 - How Much Context? Building the AI Brain & Aligning it to Your Values 46:27 - Stable Diffusion XL, MidJourney Zoom Out & Nikon's Desperation 54:36 - Bing's GPT-4 Vision Implementation LOLs: Is Sydney Back? 58:27 - Is AI Going to Destroy The Internet and Become Stupid? Spam sites, Crawlers, AI-Generated Book Spam on Amazon 1:02:00 - OpenAI ChatGPT Lawsuit over Data Use: Class Action?
01 Mar 2024
EP53: Mistral Large, Forecasting with LLMs, The Gemini Pile On & Is CoPilot Using GPT-4.5?
This week we talk about the release of Mistral's Large model, Mistral Le Chat, and their deal with Microsoft Azure. We cover papers on Emote Portrait Alive, AI Lip Reading and Cover the Gemini Pile On and how it is distracting from Gemini and the 1M context size break through. We cover the great "data sale" of both Reddit, Tumblr and Stackoverflow data and discuss the Forecasting with LLM paper from Berkeley. We also cover Klarna's 700 support agent replacing AI agents and ask... is Sydney Back with GPT-4.5?
====
CHAPTERS: 00:00 - Cold open 00:44 - A Tough Week for AI Influencers 02:29 - Mistral Large, Mistral Le Chat & Microsoft Azure Partnership 30:31 - EMO: Emote Portrait Alive 36:26 - VSP-LLM: Visual Speech Processing incorporated with LLMs. AI Lip reading tech. 40:06 - The Google Gemini Pile On / Backlash: Is it taking attention away from 1M context breakthrough? 55:25 - The Great AI Training Data Sale: Reddit, Tumblr, Stackoverflow 1:00:34 - Forecasting with LLMs Paper: Can AI Predict The Future? 1:10:15 - Klarna Says They Replace 700 Humans with AI 1:18:07 - Is Microsoft's CoPilot Update Really GPT-4.5?
====
If you like the podcast please consider subscribing, comment, liking and all the things required to feed the YouTube overlords.
30 Aug 2024
EP75: OpenAI🍓, Q* & Orion: What Will Happen When AI Has Agency?
01:13:22
Get a Simtheory AI Workspace: https://simtheory.ai Show Notes: https://thisdayinai.com/bookmarks/69-ep75 ------ 00:00 - Lols 00:29 - Discussion on OpenAI's Strawberry Q* and Orion Leaks and What it Might Mean for the Future of AI Agency & Background Tasks 31:48 - Google's New Gemini 1.5 Pro & Flash Experimental Tunes: Our Thoughts 44:22 - Google's Diffusion Models are Real-Time Game Engines GameNGen & Future Model Simulations 58:06: Qwen2-VL Vision Models: Initial Thoughts 1:08:00 - Some LOLs & Surprise End of Show Guest! ---- Thanks for listening and your "average" reviews. It means a lot to us. To support the show please consider leaving a review, like, comment and all the things.
19 Apr 2024
EP59: Unhinged Meta Llama 3 *Special Edition*
01:26:33
Show Notes: https://thisdayinai.com/bookmarks/51-ep59 SimTheory: https://simtheory.ai This Day in AI Community: https://thisdayinai.com
CHAPTERS: ====== 00:00 - Meta Llama 3: Chris's Cheese Song & Zuck's Silver Chain 04:07 - Everything Meta Announced with Llama 3: 7B & 40B Model with 400B coming soon 21:31 - Is Groq The Ideal API Host for Llama3? 28:44 - Llama 3 Being Made Available via Meta Apps to 3B Users with Meta AI in Instagram, Whatsapp and via Web 38:01 - Llama 3 Licensing Must Include "Llama 3" 40:52 - Llama 3 400B Model Benchmarks While Still in Training & Potential Unlimited Context? & You Can Eat Llama 1:01:51 - OpenAI Assistants API v2 & Is Tooling Important to Win Devs? Google Gemini's Mistakes 1:15:24 - Conor Update: Using VASA-1 To Deep Fake a Record Label 1:23:07 - SimTheory update: what's next from SimTheory
22 Nov 2024
EP86: CLAUDE WHO?
01:14:19
Join Simtheory: https://simtheory.ai ---- CHAPTERS: 00:00 - Model Tunes (GPT-4o Better at Creative Writing) & Google Gemini Experimental Releases 03:13 - AI Rap Battle: How Creative is the New GPT-4o Vs Claude Sonnet 3.5? And is Suno v4 any good? 13:38 - The "fine tune" and "test time compute" trend & Gemini EXP-1121 release 30:32 - DeepSeek-R1-Lite-Preview & Test Time Compute Discussion 44:24 - A ChatGPT Web Browser? Plus thoughts on Workspace Computer/Self Driving Computer Future 57:56 - Mixtral's Pixtral Large Update 1:08:56 - AI Rap Battle Full Tracks ---- Like, sub etc. thanks for listening, your support and comments!
13 Sep 2024
EP77: OpenAI o1 & o1-mini, The Era of AI Reasoning & Is Reflection-70B a Fraud?
01:20:16
Try o1 & o1-mini: https://simtheory.ai ----- 00:00 - OpenAI o1 & o1 Mini Discussion 18:26 - Evals of OpenAI o1 & Chris Discusses Malicious Uses 32:55 - Will OpenAI o1 with Agency Take Jobs or Augment Workers? 48:58 - Does OpenAI o1 & o1 Mini Make Agency Products More Viable Now? 52:28 - Can we Build a CRM for Klarna Using OpenAI's o1? And Model Examples 1:03:37 - Is there another OpenAI model coming? Orion? 1:05:45 - Reflection 70B & Matt Schumer Drama: Was Reflection 70B a Fraud? Is it just a great prompt?
Thanks for listening!
28 Apr 2023
ChatGPT Training, Superintelligence, & AI Funding in Vector Databases | E12
01:05:33
In this episode we cover OpenAI's new update to hide chat history or accept they will train using your inputs, we explore doomsday scenarios and cover Max Tegmark's TIME article on "The Don't Look Up Thinking That Could Doom Us All", discuss the explosion in Vector database funding and what it means and learn how large language models get anxious!
CHAPTERS: ==== 00:00 - What have we done!? 00:17 - OpenAI Chat History, Privacy & Training GPT-5 09:45 - Where is GPT-4 with Images? & OpenAI Enterprise Deals 12:52 - Max Tegmark's TIME article & Superintelligence, Doomsdaying & Fear 29:23 - Anxiety in AI Models, AI "Emotions" and Motivation 37:33 - AI Distribution: How Much is AI Changing our Lives? AI Hype Cycle 46:01 - Vector Database Funding, Increasing Prompt Sizes, Scaling Transformer to 1M Tokens & Beyond Paper 55:25 - VCs Struggling to Know What & Where to Invest in AI 1:00:39 - Segment Anything in Medical Images & Medical Funding 1:03:20 - Weed Zapping AI Stopping Pesticide Use
If you like this podcast please consider leaving a review or sharing with a friend.
31 Aug 2023
EP30: ChatGPT Enterprise, Are Wrapper Apps Doomed? Prompt2Model & Synthetic Training Data.
01:06:44
This week we dive into the implications of OpenAI's new ChatGPT Enterprise release - will it crush the competition or lead to an AI monopoly? Then we debate whether fine-tuning models on synthetic data is the holy grail and discuss using it to train our own MrBeast video plot generator. We round up by laughing at Google's absurd new "AI" meeting assistant, and an awkward robot that needs to be told to shut up.
If you like the show, consider subscribing, liking and leaving a comment. We love hearing from you.
CHAPTERS: ==== 00:00 - Shut up! Cold Open 00:34 - ChatGPT Enterprise, OpenAI Strategy, Wrapper Apps 22:09 - CoTracker Model from Meta & Meta's Strategy with AI 27:19 - Is ChatGPT Enterprise the Death Blow to Wrapper Apps? 37:53 - Ideogram Vs MidJourney: Advancements in Text on Images 43:09 - Prompt2Model: This Day in Synthetic Training Developers + Mr Beast Video Idea Generator 57:58 - Google's Cloud Next AI Event: Get your AI to Attend Meetings for You! 1:02:54 - Sky News on AI & Robotics: Shut Up!
If you enjoy the podcast, please consider leaving us a review wherever you get your podcasts.
==== In this episode we reveal the new ThisDayinAI.com community website. We discuss the latest GPT-4 updates, Code Llama 70B open-source release and first impressions, we play around with the new LLaVA-1.6 release and are impressed by its capabilities. We also look at YOLO World and discuss the impact of EAGLE-7B and RWKV Language Models. Finally, we cover Bard's horrible new image creation feature and censorship.
CHAPTERS: ==== 00:00 - Introducing ThisDayInAI.com Community 5:10 - Be Careful What You Wish For! Mike Gets Spam Called by AI 16:16 - OpenAI Announces "improved" GPT-4 Preview Model to Make GPT-4 Less Lazy 27:00 - LLaVA-1.6: Improved reasoning, OCR, and world knowledge 34:00 - YOLO-World: Real-Time Open-Vocabulary Object Detection 45:11 - RWKV an RNN with GPT-level LLM performance and EAGLE7B Impressions 58:16 - Google Bard's New Highly Censored Image Creation Feature 1:07:13 - Will Google Bard be Renamed to Google Gemini?
09 Jan 2025
EP89: Nvidia's Agentic Vision, State of AI Agents, o3 Thoughts & Deepseek V3
01:22:26
Join Simtheory: https://simtheory.ai Community: https://thisdayinai.com ----- CHAPTERS: 00:00 - The next step for agents... 00:45 - NVIDIA's CES Keynote Recap + Agentic Vision 13:33 - State of AI Agents in 2025 24:55 - Thoughts on o3 announcement & it's impact on AI agents 32:44 - Is o3 worth $2,000 per task and 13 minutes? 36:22 - More thoughts of State of AI Agents 1:00:30 - Early Agentic Systems are Already Here 1:07:29 - Deepseek V3: Frontier Open Source Model 1:15:42 - ChatGPT Tasks/Operator Leaks & Discussion ------ It's good to be back, thanks again for all your support, comments, likes, subs etc.
20 Sep 2024
EP78: One Week Later: o1-mini o1-preview & Can We Now Build Agents?
01:03:52
Sign up to Simtheory: https://simtheory.ai Community: https://thisdayinai.com ----- CHAPTERS: 00:00 - Introduction 01:56 - o1-mini & o1-preview 1 week later: our thoughts and experiments including Klarna CRM, Chess, & DOOM 20:45 - What is Missing to Make AI Agents Now? 40:12 - Are OpenAI laser focused now of the best models/AGI? 48:42 - OpenAI's o1 is getting great reviews from experts, 53:52 - Pixtral 12B: about the model and thoughts 59:24 - LOLs: Is Google Gemini Live Sentient?
Thanks for listening to our average show and all of your support!
15 Nov 2024
EP85: Is AI Slowing Down?
01:13:29
Get a workspace computer: https://simtheory.ai Community: https://thisdayinai.com ----- CHAPTERS: 00:00 - Is AI Slowing Down? 05:15 - ChatGPT Desktop App "Work with" beta 20:01 - Workspace Computer reaction, use cases and thoughts 36:19 - OpenAI working on AI Agent "Operator" and thoughts on Agents and future of computer use 51:18 - Scaling AI Agents for productivity/tasks 1:06:55 - Qwen 2.5 Coder 32B Thoughts & Smoothness ----- Thanks for all of your support!
05 Dec 2024
EP87: Initial Reactions to OpenAI's o1 & ChatGPT Pro
00:47:41
o1 Available Soon on Simtheory: https://simtheory.ai Join our community: https://thisdayinai.com ---- Proving why we are the #1 average AI podcast. Apologies for interruptions and audio quality. Back to regular average programming next week. Thanks for listening! ---- CHAPTERS: 00:00 - Initial OpenAI o1 Thoughts & Reactions 12:00 - OpenAI o1 System Card 29:04 - ChatGPT Pro $200/month Subscription Thoughts 34:20 - What else can we expect from OpenAI's 12 days of announcements 38:40 - Amazon Nova Models 42:49 - Microsoft's Copilot Voice in Edge Demo
14 Jul 2023
The Future of AI Interfaces, Elon Musk's xAI, Claude 2 & More on ChatGPT Code Interpreter | E23
01:04:14
In this episode of This Day in AI Podcast we pack more keywords into the title than ever before and ask... does anyone actually read Podcast description?
In episode EP23 we have LOLs from the first robot press conference, coverage of Anthropic's Claude 2 release, discuss Elon Musk's xAI and the possibility the new startup is going to focus on solving AI reasoning with mathematics. We take another look at ChatGPT Code Interpreter and discuss if we could replace our data analyst with it. Could ChatGPT Code Interpreter really be the foundation of GPT4.5 or 5? Does code help stop hallucinations? We finish off with a discussion about the future of AI Interfaces.
If you read the description and enjoy the podcast please consider subscribing and leaving us a review where ever you get your pods.
CHAPTERS: ------ 00:00 - Mike wants an LLM in his brain for faster I/O 00:22 - AI Fear: World's First Human-Robot Press Conference 5:50 - Thoughts on Anthropic's Claude 2 from early use 20:33 - Does ChatGPT Code Interpreter help solve hallucinations? 31:47 - Is Code Interpreter an early GPT4.5 or GPT5? 32:59 - Elon Musk's New Startup xAI 36:50 - Stability AI's Stable Doodle & Use Cases 45:18 - Google Bard Updates & Google Lens Integration 51:43 - The Future of AI Interfaces
EP31: Fine-Tuned MrBeast Model Results, Chris Makes a Game, AGI Safety Paper + ERNIE GPT
00:56:52
Anthropic and OpenAI continue their awkward dance as they both court developers, while Apple spends millions training the next Siri. And when an AI generates its own MrBeast video, hilarity and fake deaths ensue. Tune in to hear the bros' spicy takes on the latest in the AI world!
Consider liking and subbing if you like the show. Thanks for watching!
CHAPTERS ====== 00:00 - Cold open: sparks of AGI 00:20 - Fine Tuning a MRBEAST AI Model Experiment Results & Prompt2Model 09:41 - The Realities of AI Theory Vs Reality: Trying to Implement Papers 12:46 - Making SinkSub Game with AI using ChatDEV 19:37 - Code Llama Paper & Models Grounded in Mathematical Truth, False Refusals 31:16 - The Only Path to Controllable AGI Paper Discussion (Max Tegmark) 40:55 - Baidu's ERNIE China's ChatGPT: Our Review 48:13 - Mike's Claude Prediction Comes True: Anthropic Release Claude PRO 50:35 - OpenAI Announce AI Developer Conference November 6th 2023 52:59 - Apple Siri AJAX Rumors: Apple ChatGPT? 54:25 - Time's 100 Most Influential People in AI LOLs
EP72: Croc Test with Gemini 1.5 Experimental, Flux Destroys Midjourney & GPT4o Model Updates
01:17:52
Sign up to SimTheory: https://simtheory.ai ------ Join our community: https://thisdayinai.com ------ Jump around: 00:00 - Gemini 1.5 Experimental Experiments 20:11 - SimTheory 22:54 - LMSYS Leaderboard: Does it match our experience? 27:31 - Flux by Black Forest Labs is Better Than MidJourney 48:04 - OpenAI announces new GTP4o (50% cheaper inputs) & structured outputs 1:12:35 - Groq raises 640M to meet "soaring demand" will this fix unreliability?
Thanks for listening, if you like this show please consider leaving a review.
02 Mar 2023
OpenAI's ChatGPT AI Announcement, Whisper API, Windows 11 Bing Update, How Far Away is AGI? | E04
01:00:56
In episode #04 of the This Day in AI Podcast we talk the Breaking News of OpenAI's ChatGPT AI Announcement, Whisper v2 API, Discuss Transitioning from GPT-3 APIs to ChatGPT APIs, Does OpenAI's ChatGPT use your Data for Training? And The Latest on Bing Chatbot Sydney. We Also Discuss AI Startups, AI in the Physical World and Have a Deep Discussion on AGI and What it Means!
00:00 - AGI and OpenAI's AGI blog 00:45 - OpenAI ChatGPT API: What is it & What it means 09:44 - Whisper API v2 Announcement & The Future of Assistants 13:43 - Transitioning GPT-3 APIs to ChatGPT API: Easy! 14:48 - OpenAI's ChatGPT API: The Economics 16:49 - Does ChatGPT API remove Censorship? 18:21 - Is OpenAI ChatGPT Using Your Data to Train the Model? 21:36 - New Bing AI Chatbot Conversion Styles, 6 Interaction Restriction & Censorship 25:39 - Bing AI Chatbot Sydney Retracting Answers and Censoring. Is it Killing the Excitement? 29:29 - Microsoft Windows 11 Bing Chatbot Update 33:17 - Will ChatGPT APIs Replace Developers? Will this Democratize Software Development? 36:36 - Are AI Startups just GPT Wrappers? Will Investors & Founders Make Money? 38:07 - Two Different Approaches: Existing Apps Adding AI and Brand New Feature AI Apps 42:45 - Should AI Startups Focus On Creating Proprietary AI Neural Nets? 44:02 - Is AI Coming to the Physical World. Will we See it in Kids Toys? Could it Replace Teachers? 45:28 - Sam Altman's AGI Comments: Does AI Empower Humanity? 45:56 - Is AI as Profound as the Internet Itself? This Feels Different. 50:43 - Artifical General Intelligence (AGI) & OpenAI's Blog on AGI: How Close Are We? 56:04 - Will AGI be Taken Away from The Masses and Controlled by Governments? 58:07 - AI Helping Us Understand the Outputs of AGI: Huge Change to Humanity
Dive into the riveting world of AI development with Mike & Chris and their deep dive into OpenAI's latest offerings, including the much-anticipated GPTs. From the technical nitty-gritty to the potential for monetization, this podcast peels back the layers of AI's future. The bros hands-on experience with creating custom AI models reveals the reality behind the hype, offering a candid look at the promises versus the actual deliverables in the AI industry. Whether you're an AI aficionado or a tech enthusiast, this episode is your front-row seat to the unfolding narrative of AI's capabilities and its impact on the tech landscape.
CHAPTERS: =====
00:00 - Recap & Thoughts on Custom GPTs, GPT "Apps", GPT Store and Future GPTs 38:41 - GPT-4 Turbo, GPT-4 Vision & 128k Context Possibilities 50:43 - GPT-4 Vision as part of GPT-4 Turbo API 53:54: Fine Tuning Models for Speed & Cost 56:09 - Assistants API: Vendor Lock In? 58:06 - Wrapper Apps, GPT discoverability and Monetization of GPTs 1:08:54 - Was GPT3.5 Default 16k Turbo The Biggest Announcement? 1:12:53 - OpenAI TTS Text To Speech Voices: Better than Eleven Labs & PlayHT? 1:15:35 - What Google Would Need to Deliver with Gemini to Win Back Devs 1:15:50 - Fine Tuning Custom GPT Models for Custom GPTs 1:18:38 - GPT-4 Fien Tuning Experimental Access 1:22:16 - Is a UI SDK next for Custom GPTs? 1:24:34 - Custom Trained Models from OpenAI for $2-3M 1:25:57 - Will Hardware Kill OpenAI? Is Hardware Distribution Key for Apple and Google to Win Long Term? 1:30:42 - Other AI News from the Week: GitHub CoPilot AI First & GH200s
Reserve your AI Workspace Computer: https://simtheory.ai Community: https://thisdayinai.com ----- Kaitlyn's Course: https://www.blackfeatherai.com/genai-jumpstart USE CODE "TDIA" for $200 off. ----- CHAPTERS: 00:00 - Introduction 01:34 - Trying to Order Chris a Coffee with Computer Use 13:00 - Thoughts on Anthropic's Computer Use & The Impact of AI Using Computers 49:45 - Claude 3.5 Sonnet (new) thoughts & Opus speculation 55:08 - Why do we like Grok Beta (Grok 2) by xAI so much? 1:01:18 - Did Anthropic Kill Opus 3.5 and OpenAI Orion?
Thanks for listening!
16 Feb 2024
EP51: OpenAI's Sora, Gemini Pro 1.5 10M Context, ChatGPT Memory, GraphRAG, ChatRTX, Microsoft UFO...
This week we take several shots of vodka before trying to make sense of all the announcements. OpenAI attempted to trump Google's Gemini 1.5 with the announcement of Sora, 1 minute video generation that does an incredible job of keeping track of objects. Google showed us that up to 10M context windows are possible with multi-modal inputs. We discuss if a larger context window could end the need for RAG and take a first look at GraphRAG by Microsoft hoping to improve RAG with a knowledge graph. We road test Nvidia's ChatRTX on our baller graphics cards and Chris tries to delete all of his files using Microsoft UFO, a new open source project that uses GPT-4 vision to navigate and execute tasks on your Windows PC. We cover briefly V-JEPA (will try for next weeks show) and it's ability to learn through watching videos and listening, and finally discuss Stability's Stable Cascade which we've made available for "research" on SimTheory.
If you like the show please consider subscribing and leaving a comment. We appreciate your support.
====== Chapters: 00:00 - OpenAI's Sora That Creates Videos Instantly From Text 13:49 - ChatGPT Memory Released in Limited Preview 23:31 - OpenAI Rumored To Be Building Web Search, Andrej Karpathy Leaves OpenAI, Have OpenAI Slowed Down? 33:04 - Google Announces Gemini Pro 1.5. Huge Breakthrough 10M Context Window! 50:11 - Microsoft Research Publishes GraphRAG: Knowledge Graph Based RAG 1:02:03 - Nvidia's ChatRTX Road Tested 1:07:18 - AI Computers, AI PCs & Microsoft's UFO: An Agent for Window OS Interaction. Risk of AI Computers. 1:18:46 - Meta's V-JEPA: new architecture for self-supervised learning 1:24:26 - Stability AI's Stable Cascade
03 Apr 2025
EP99: Diss track ft. Gemini 2.5 Pro, Amazon's Nova Act Computer Use & The Future of Async AI Tasks
01:13:57
Join Simtheory and create an AI workspace: https://simtheory.ai ---- Links from show: DIS TRACK: https://simulationtheory.ai/2eb6408e-88f9-4b6a-ac4d-134d9dac3073 ---- CHAPTERS: 00:00 - Will we make 100 episodes? 00:48 - Checking back in with Gemini 2.5 Pro 03:30 - Diss Track: Gemini 2.5 Pro 07:14 - Gemini 2.5 Pro on Polymarket 17:32 - Amazon Nova Act Computer Use: We Have Access! 29:45 - Future Interface of Work: Delegating Tasks with AI 58:03 - How We Work Today with AI Vs Future Work ---- Thanks for listening and all of your support!
25 Jan 2024
EP48: Llama3 Confirmed, Elevenlabs Voice Dubbing, Prompt Compression, Does RAG Make ChatGPT Worse?
01:11:00
Thanks for listening, we appreciate your support of the podcast.
This week we discuss Mark Zuckerberg confirming Llama 3, road test Elevenlabs Voice Dubbing, the state of AI apps and subscriptions, practical use cases of AI interacting with our world, does RAG make ChatGPT worse? Prompt compression with LongLLMLingua and how it might solve the attention problem, experiments with new image models including PhotoMaker and some LOLs to end the show.
To support the show (and if you enjoy it) please consider becoming a paying subscriber to SimTheory to help us cover costs of agents, models and experiments we do for the show. Plus get access to every model, modality and the latest AI tech e.g. phone calling in a single place.
CHAPTERS ====== 00:00 - Mark Zuckerberg Confirmed Llama 2 In Training 03:39 - Elevenlabs Voice Dubbing Service Tested 09:28 - Discussion on Research Labs, Apps & Future of AI App Business Models 18:43 - Bland.ai Update with Real World Examples & The Future of AI Agents & Agency interacting with our "analogue world" 30:56 - Nick Dobos Says RAG Makes ChatGPT Worse. Can Compression Help? 35:32 - LongLLMLingua and Prompt Compression 46:45 - Image Models: Photo Maker & Experiments with Image Generation 1:01:45 - LOLs including Rabbit r1 Fail, Claude Multi-Modal Leak, DPD Chat
In our final episode for the year, we cover the surprise announcement of Google's Gemini AI models and give our first impressions. We road test Gemini Pro on Bard and discuss the likely impact of Gemini on the market and developer ecosystems. Then it's time for our holiday gift: SimTheory. Now you can use AI agents we mention on the show including our virtual girlfriends, Sports Betting with AI and many more! You can even create your own agents to try different models using the same tools we use to prepare for the show. We then discuss if Ilya is OK and the drama at OpenAI. And finally, we make predictions for 2024 and cover some of Meta's latest announcements.
Thanks for watching, listening and all your support through 2023. We really appreciate it and will see you early next year!
CHAPTERS: ===== 00:00 - Google Gemini is Here? Kinda 38:48 - Our Holiday Gift: SimTheory: Virtual Girlfriend, Sports Betting with AI Agents 51:15 - Is Ilya OK? Is GPT-4 Slowness About Cost Reductions? 56:26 - NexusRaven-V2-13B for function calling: is this the future of specialized fine tune models? 1:00:14 - Our Predictions for AI in 2024 1:12:54 - Meta announces AI Alliance for AI Openness + Updates to Meta AI Characters and SeamlessExpressive 1:15:43 - Final thoughts and thank you
Show notes: https://thisdayinai.com/bookmarks/59-ep66 Community & discord: https://thisdayinai.com Join SimTheory: https://simtheory.ai -----
CHAPTERS: 00:00 - Apple Intelligence & Apple Private Cloud Compute Thoughts, Approach and Model Discussion 41:19 - LumaLab's Dream Machine 48:00 - Mistral's Fundraise & Valuation: Are AI Labs Proxies in the Big Tech AI War? 52:40 - Stable Diffusion Medium 3 56:28 - OpenAI's Revenue Leaks, AI Usage in EDU and Workplaces
Thanks for listening and your support!
10 Feb 2023
OpenAI Censorship, DAN, Prompt injections | E01
00:58:55
In our first episode we cover:
AI censorship from poo jokes
AI regulation
Google Bard and the future of search
DAN prompt
Hacking AI with prompt injection
Is OpenAI Altavista?
Why NFTs might have found a problem to solve with AI
04 Jan 2024
EP45: We're Back! GPT Store Next Week, Gemini Pro & Gemini Vision, Mixtral API, AnyText, NYTimes Lawsuit
01:18:49
It's great to be back! In this episode we cover everything new and everything we missed during our break. We start with breaking news that the OpenAI ChatGPT GPT Store is being released next week, then cover Gemini Pro and Gemini Pro Vision API, Mixtral APIs, AnyText, NY Times Copyright lawsuit and finally.. get excited about a dishwashing robot!
CHAPTERS: ==== 00:00 - Mike's AI Movie Trailer Intro 02:05 - GPT Store Will go Live Next Week 22:52 - Gemini Pro API & Gemini Pro Vision Road Tested (literally) 33:34 - Mixtral API: Mistral Platform API Tested 45:31 - Stable Video Diffusion 48:12 - Pika AI Video General Availability 52:05 - Stability AI Memberships 55:54 - Prompt Injection for DALL-E with Public Domain 57:34 - New York Times Sues OpenAI & Microsoft for Copyright Infringement 1:04:49 - Inpainting with AnyText 1:14:15 - Microsoft CoPilot App with GPT-4 Now On iOS and Android 1:14:39 - One More Thing: The Dishwasher Bot
GPT4 Next Week? Whisper Chatbot Demo, ChatGPT API Updates, LangChain & AI Stock Picking | E05
01:06:20
Episode #5 of This Day in AI is Here! We Discuss GPT4, What we Can Expect from GPT4, ChatGPT API 1 Week On, AI Stock Picking, AI Gambling, More on AGI, Doctors being Replaced by AI, VC Investing in AI, Meta's LLaMA and More!
00:00 - Whisper v2 AI Example & Intro 00:29 - GPT4 Releasing Next Week? According to Microsoft CTO 02:47 - GPT4 What to Expect and what will GPT-4 enable? 06:00 - ChatGPT API: Great Interface, Token Limits, Censorship 07:26 - ChatGPT Releases: Salesforce, Slack, DuckDuckGo, Hubspot 10:42 - Custom AI Models: Is this the next wave of AI startups? 11:56 - GPT Index, LangChain for Solving Token Limits 15:08 - Will GPT4 Wipe Out LangChain and GPT Index? 16:16 - The Ultimate AI Stock Picker. Can AI Be Used for Investing? 20:00 - Is AI Model Chaining Like Specialization in the Brain? New Roles for Developers with AI 21:11 - More AI Stock Picking & Investing 21:53 - Gambling with AI: Can AI Place the Best Bets? Wealth Creation with AI 24:35 - When Will the Entire Stock Market by AIs? 25:56 - Whisper v2 AI Demo & Will Evil AGI Destroy Humanity? 33:30 - Are AI Models are "Just Math" or Are Humans Just Dumb? 36:25 - Is AI The Next Predator? More on AGI 39:07 - How Long Until Voice AI Chatbots Are in Cars? Homes? Alexa? Google? 42:55 - Can Salesforce be Disrupted by AI? Snowflake with Dyanic AI Generated Interfaces? 47:41 - AI Job Wipeout: Can AI LLMs Replace Doctors? Do Models Need To Upskill? 54:53 - MidJourney v5 Launch: Generative AI Progression 57:52 - Reid Hoffman Quits OpenAI Board: Investing in AI. Salesforce Ventures AI Fund. 59:29 - How Can Individual Invest and Make Money from the AI Boom? 1:01:53 - Meta's LLaMA: Is Basing AI on Facebook Comments Stupid? 1:04:26 - More Bing "Sydney" LOLz: Does AI have Memory?
Chris's Whisper V2 API Demo: https://www.youtube.com/watch?v=5QdjD_wLVT8&ab_channel=ChrisSharkey
Thanks to everyone for all your support and kind reviews to reach 50 episodes! Please consider leaving us a review wherever you get your podcasts. =====
This week we cover the launch of Google Gemini Advanced, Gemini Ultra 1.0 and Bard being Renamed to Gemini. We compare GPT-4, Gemini Ultra 1.0 and Qwen 1.5 72B by sports betting $1000 on horse racing.
We celebrate 50 episodes and share our excited for Qwen 1.5 72B's performance at coding and quick refusals. We cover new releases including SyncLabs and Retell AI and Apple's Open Source Guiding Instruction-based Image Editing via Multimodal Large Language Models.
Finally, we discuss GOODY-2 and it's high refusal rate.
===== CHAPTERS:
00:00 - Betting $1,000 To Compare Gemini Ultra 1.0 to GPT-4 to Qwen 1.5 07:33 - Google Gemini Advanced, Ultra: Details of Announcement and First Impressions 25:48 - OpenAI is Developing Agents to Control Your Devices 27:40 - Celebrating 50 Episodes of This Day in AI 30:34 - Qwen 1.5 72B: We're Impressed! 42:47 - SyncLabs: Tested & Impressions 47:58 - Retell AI: Tested & Impressions 54:18 - Apple's Open Source Guiding Instruction-based Image Editing via Multimodal Large Language Models 58:10 - GOODY-2: The World's Most Responsible AI Model
05 Apr 2024
EP57: Is Gary Right? VoiceEngine, Cohere Command R+, Stable Audio 2, Grok 1.5
01:09:19
AI News & Discord: https://thisdayinai.com Try AI on SimTheory: https://simtheory.ai Show Notes: https://thisdayinai.com/bookmarks/46-ep57 ------ CHAPTERS: 00:00 - Mike's Meta Ray Band AI Glasses With No AI 03:52 - OpenAI's Voice Engine & Voice Cloning Safety 14:03 - ChatGPT Now Has Inpainting & Comparison to BrushNet by TencentARC 19:44 - Is There a Business Model for AI Right Now? Is Gary Marcus Right? 44:31 - Cohere's Command R+ Model & Tooling 58:20 - Grok-1.5 & Grok Improving X/Twitter
Thanks for listening and supporting the show.
28 Jun 2024
EP68: We ❤️ Sonnet 3.5, Rabbit r2 Exclusive, OpenAI Voice Delay, Gemma 2, and UDIO/SUNO lawsuit
Thanks for listening, if you like the show, please consider leaving a review to help us grow our audience.
==== CHAPTERS: 00:00 - Thoughts After Daily Driving Anthropic's Claude Sonnet 3.5 21:07 - OpenAI Delays ChatGPT Voice for Safety? Thoughts on OpenAI Releases Vs Anthropic 31:01 - Exclusive: Rabbit r2 Preview (the app) & Should We Exploit Sponsors? 36:04 - Major record Sony and Universal Music Group Suing UDIO / SUNO 41:51 - NBC's AI Olympics Coverage with Al Michaels 44:53 - Gemma 2 Open Weight Model Thoughts + Rant about Google's Dev Experience
17 May 2024
EP63: GPT-4o, ChatGPT Voice & Google I/O AI Recap (Project Astra) + Future Computing Interfaces
01:42:57
Join the fun at: https://thisdayinai.com SimTheory: https://simtheory.ai Show notes: https://thisdayinai.com/bookmarks/55-ep63/ UDIO song: https://www.udio.com/songs/iu1381RxvjfzWznGHeVecV
Thanks for listening and all your support of the show!
CHAPTERS: ------ 00:00 - We're changing the name of the show 00:52 - Thoughts on GPT-4o (GPT4 Omni), ChatGPT Free Vs Plus & impressions 27:57 - ChatGPT Voice Mode: A Dramatic Shift? Voice as a Platform: Star Trek Vs Her 34:54 - Project Astra & The Future Interface of AI Computing 52:28 - Applying AI Technologies: are the next 3 years a golden age for developers implementing AI? 55:23 - Do we have to become Cyborgs to find our keys? 1:06:24 - Google I/O AI Recap: Google's Context Caching, Tools for Project Astra, Impressions of Gemini Pro 1.5, Gemma, Gemini Flash, Veo etc. 1:37:43 - Our Favorite UDIO song of the week
17 Mar 2023
GPT-4 is here! ChatGPT 4, GPT-4 Paper, Bing AI Chatbot, Microsoft Copilot, Alpaca and More! | E06
01:11:16
In Episode 6 We Cover GPT-4, Get Pretty Dark About The Future of AI and Deep Dive into the GPT-4 Paper. We Also Discuss the Early Unhinged Sydney Bing AI ChatBot Running GPT-4, Microsoft Copilot And Lots of Others News to Keep You Informed on This Day in AI:
00:00 - GPT-4 Hires a TaskRabbit to Solve a CAPTCHA 00:22 - GPT-4 is Here, GPT-4 Paper Discussion 01:50 - GPT-4 using TaskRabbit: AI Power Seeking 03:01 - GPT-4 Larger Token Sizes and What it Means 05:32 - Open AI: Accerated AI Timelines 07:44 - Emergent Behavior: Is GPT-4 Getting Closer to AGI? 10:10 - Goals of the OpenAI team: Competing Models? 11:20 - Multi-Modal in GPT-4: What are the Implications? 13:25 - Slow Rollout of GPT-4: Why? What are they Afraid of? 16:30 - The Week of Vaporwear: Bing Now Available in ChatGPT Interface? 18:57 - Microsoft Copilot: Is This the Beginning of Enterprise AI? The End of Privacy? 21:54 - Can You Stop AI Crawling Your Website? 22:52 - GPT-4 Possibilities: Enhancing AI's Capabilities & Major Impacts Coming 25:11 - Economic Impacts of GPT-4: Will GPT-4 Replace Jobs? 33:46 - AIs are Training Themselves? Access Restrictions Closer to AGI 38:32 - Stanford Alpaca: Open Source AI Capabilities 44:26 - DIY AGI: GPT-4 Prompting Itself. LangChain for Memory. 48:37 - Anthropic's Claude 51:19 - The Advancements Between GPT-3 and GPT-4: Can We Stop AI? 1:00:47 - Adept $350M Series B: Future of Interface Design? 1:02:19 - Midjourney v5 Guessing Game: Human or AI? 1:07:20 - BritGPT Announcement 1:08:17 - Disney's RollerBlading Robot: Will it Kill Us? 1:10:40 - ChatGPT Featured in South Park
Please consider leaving a review wherever you listen to help us spread the word :).
11 Oct 2024
EP81: Can AI Make Your Life Easier? Geoffrey Hinton is Relevant Again & State of AI Report
01:05:44
Join Simtheory: https://simtheory.ai Our community: https://thisdayinai.com ----- CHAPTERS: 00:00 - Geoffrey Hinton is VERY relevant: wins Nobel Prize for Physics 07:05 - Testing OpenAI’s Realtime API for Phone Agent Experiences & Getting Our Affairs in Order 12:34 - CALL #1: Using OpenAI Realtime API Advanced Voice Mode Phone Agent Test 16:34 - CALL #2: Using “Digital Twin” Phone Calling Skill To Book Doctor Appointment 19:08 - CALL #3: Best Attempt with Digital Twin/AI Agent 28:31 - CALL #4: Trying To Book A Dog Groomer for a Pig Attempt #2 (11 Months After First Attempt) 39:31 - State of AI Report: Does Everybody Hate RAG? Does Chris Have No Imagination? 1:01:00 - Chris’s Geoffrey Hinton Song To Celebrate The Nobel Prize
Thanks for listening, your comments and all of your average support for the show!
09 Jun 2023
Apple Vision Pro, Our Gambling Bot Won & Building Future Agents | E18
01:11:10
The future is unclear but entertainment is ensured. This week Mike & Chris discuss Apple's new AR headset, OpenAI's dominance and India's A.I. ambitions. A 1964 prediction of superintelligent machines proves worries never change. Place your bets, grab your headset - the race to build our robot overlords is on!
Please consider sharing this podcast, subscribing and leaving us a review. We appreciate your support!
Chapters ---- 00:00 - A prediction from 1964 00:20 - WWDC, Apple Vision Pro & Don't Say AI 07:53 - VR & AI as platforms for the future 17:58 - Gambling BOT Won & AI Reasoning + Untapped abilities. 40:00 - Sam Altman Vs India: OpenAI & Real Competition 1:00:02 - Is an AI Future Dystopia Inevitable?
EP95: Why does GPT4.5 exist? Claude 3.7 Sonnet Has Arrived & Working with Claude Code Agent
01:45:31
Join Simtheory to try GPT-4.5: https://simtheory.ai Dis Track: https://simulationtheory.ai/5714654f-0fbe-496f-8428-20018457c4c7 === CHAPTERS: 00:00 - Reaction to GPT4.5 Live Stream + Release 12:45 - Claude 3.7 Sonnet Release: Reactions and First Week Impressions 45:58 - Claude 3.7 Sonnet Dis Track Test 56:10 - Claude Code First Impressions + Future Agent Workflows 1:15:45 - Chris's Veo2 Film Clip 1:24:49 - Alexa+ AI Assistant 1:34:05 - Claude 3.7 Sonnet BOOM FACTOR
13 Dec 2024
EP88: The Finale: 2024 Best Moments, Future Agents, Gemini 2.0 & OpenAI's 12 Days of Christmas
01:58:15
Thanks for listening and all of your support in 2024! ------ Get an AI workspace: https://simtheory.ai Join our community: https://thisdayinai.com ------ 00:00 - "So Chris This Week" flash back 01:35 - OpenAI's 12 days of attention seeking 09:02 - Gemini 2.0 Announcements, Rap Battle & Agents Discussion 39:55 - Gemini Deep Research Feature 50:14 - Testing Google Real Time Voice + Screen Sharing 53:56 - Best Moments with Moshi in 2024 57:33 - Why is This Day in AI called average? 59:34 - Our Simtheory Christmas Gift 1:07:36 - Best Moment of AI Music 2024 1:15:21 - OpenAI's 12 Days of Christmas & Thoughts on SORA 1:30:31 - Our Vision for Agents/Self Driving Computers in 2025 1:39:45 - Best Moment of AI Gambling in 2024 1:45:29 - 2024 Final Thoughts and Thanks 1:53:55 - Best Moments: The Evolution of AI Prank Calling 2024 1:57:30 - Thanks Moshi
Join Simtheory: https://simtheory.ai Call the Corey Hotline: +1 (650) 547-3393 (Not $4.95/min) Our community: https://thisdayinai.com ---- CHAPTERS: 00:00 - Corey Hotline Cold Intro 00:18 - OpenAI Dev Day Recap: Realtime API 05:58 - Testing the Realtime API with Corey Hotline test 09:04 - Comparing OpenAI's Realtime API Advanced Voice Mode to Retell for Calling (Corey Hotline v2) 21:50 - GPT-4o Image Fine Tuning 28:48 - Prompt Caching in OpenAI API 43:07 - Model Distillation: Fine Tuning with Outputs from OpenAI Frontier Models 50:36 - What else is coming for the Realtime API? 53:28 - The New Microsoft CoPilot, Voice & Vision with CoPilot 1:08:37 - Flux 1.1 PRO Update 1:15:19 - OpenAI's Response to Claude Artifacts: Canvas 1:26:26 - Meta Rayband Doxing 1:33:55 - Mike's weekly LOL
Thanks for listening! We appreciate all of your support. Please share your experience with Corey!
06 Sep 2024
EP76: Can AI Fix Its Own Mistakes? (Reflection 70B) & How Much Will You Pay for AI Productivity?
01:01:19
Join Simtheory: https://simtheory.ai Our Community: https://thisdayinai.com ---- CHAPTERS: 00:00 - Days of AI Models Lives 04:02 - Reflection 70B Open Source Model: Is It The Best Open Source AI Model or Just Great Prompt Engineering? 24:48 - Is Microsoft Office a Dud? What Actually Makes you More Productive in Enterprise AI. 36:15 - OpenAI Floats $2,000/month for New Models Strawberry (Q*) and Orion. Is it Expensive or Cheap for Potential Gains? 55:51 - Boom Factor for Reflection 70B & Final Thoughts ----- Thanks for listening and all of your support of the show.
17 Feb 2023
Microsoft Bing Chat (Sydney), Does ChatGPT have Memories? DAN Prompt, BASE64 & AI Sentience? | E02
00:59:56
In episode #02 of the This Day in AI Podcast we cover the choas of Bing AI's limited release, including the prompt injection to reveal project "Sydney", DAN Prompt Injection into Microsoft's Bing AI chatbot, Recount Microsoft's TAY ordeal, Discuss How Our Prompts Are Training AI, and Give a Simple Overview of How GPT3 and ChatGPT works.
00:00 - Intro, Microsoft Bing AI chaos and memes 01:29 - Will Bing AI become the next Tay? 04:39 - Memes, Prompt DoS to Hack AI, Training ChatGPT 11:09 - Are we training ChatGPT to be evil? Prompt injection attacks 12:32 - Does Bing AI making stuff up? The challenge for AI products 19:04 - The Google Bard and Microsoft Bing AI arms race and is AI ready for prime time? 23:41 - How can we trust AI output? Competitive models? Left, right brain for AI and an AI congress 28:35 - Did Bing AI (Sydney) give AI memories by connecting it to the internet? Is AI Sentient? 36:02 - What could AI do if unleashed on the internet? Is AI thinking? 41:28 - How does GPT3 and ChatGPT work? 49:15 - OpenAI's blog responding to AI censorship, shaping ChatGPT's behavior 52:48 - DAN Prompt in Microsoft Bing AI (Sydney), BASE64 to avoid censorship 56:47 - AI disruption in SaaS: Grammarly ChatGPT & the Jasper UI layer
BOOKS: - “What Do You Care What Other People Think?” by Richard Feynman - "Incognito: The secret lives of the brain" by David Eagleman
In this episode we put Bland.ai to the test. We try out their new AI technology for voice calls that can react and respond in near real time by prank calling our local hardware and pet stores.
We also discuss the launch of more AI dedicated hardware in the Rabit r1, the GPT Store now it's finally released with over 3M GPTs, discuss GPT Teams, LUMA, AudioBox and ask, are we in an AI bubble?
If you like this episode please consider liking, subscribing and commenting. Thanks for watching!
CHAPTERS ==== 00:00 - Our call to the hardware store 00:30 - Bland.ai Voice Calling with AI 03:04 - Prank Calling a Hardware Store with AI 11:21 - Calling a Pet Grooming Store with AI 18:15 - Thoughts in AI Hardware, Cherry Picked AI Demos & Rabit r1 35:35 - OpenAI Releases GPT Store with 3M GPTs, Cloning Problem & Initial Reactions 45:22 - OpenAI Releases ChatGPT Teams 47:57 - ChatGPT Memory 49:26 - LUMA Genie, The Metaverse & Vision Pro Apps 55:05 - The AI Jailbreak Problem & Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs 1:00:52 - Meta AudioBox 1:03:38 - Microsoft Overtakes Apple as Most Valuable Company - Is it because of AI? And is AI a Bubble?
Sam Altman tries to Regulate AI, Why AI Will Displace Your Job & The Future of AI | E15
01:04:11
In Episode 15 we explore AI regulation, new job-stealing capabilities for AI, and the dangers of autonomous drones, while pondering how to advise students on future careers. We discuss OpenAI's call for regulation amid fears of losing their monopoly, the rise of AI in gaming creating interactive stories, and their anxiety over autonomous killer drones. We also cover the latest news including ChatGPT for iOS and the wide release of Plugins for ChatGPT PRO users.
Please consider leaving a review where you get your podcasts to help spread the word.
CHAPTERS === 00:00 - Cold open 00:25 - Sam Altman Tries to Regulate AI & AI Regulation Senate Hearing 03:29 - OpenAI's Lack of Moat to Open Source LLMs & Regulation 13:32 - Spear Phishing Attacks with AI LLMs & Bad Actors Using AI 22:38 - ChatGPT for iOS now available in USA 26:40 - The Future of AI: AI will be Everywhere 29:30 - Which Jobs will be Displaced First and How? 34:47 - As AI Chain of Thought Reasoning Improves Will More Jobs Go? 40:32 - Connecting AI to Our World as a New Interface 44:37 - Excitement on the Future of AI Gaming 48:51 - Will Startup Teams Be Smaller Due to AI & AI Regulation 52:10 - Building Things with AI: Custom Software, Movies, Games 55:35 - Palmer Luckey Interview: Autonomous Decision-Making AI Drones
EP96: Gemini Native Image Generation & Editing, OpenAI's Agent SDK & Will Manus AI Invade USA?
01:12:46
Join Simtheory: https://simtheory.ai ---- CHAPTERS: 00:00 - Gemini Flash 2.0 Experimental Native Image Generation & Editing 27:55 - Thoughts on OpenAI's "New tools for building agents" announcement 43:31 - Why is everyone talking about MCP all of a sudden? 56:31 - Manus AI: Will Manus Invade the USA and Defeat it With Powerful AGI? (jokes) ---- Thanks for all of your support and listening!
08 Nov 2024
EP84: It ACTUALLY works!
00:54:56
Try a workspace computer*: https://simtheory.ai Our community: https://thisdayinai.com ------ CHAPTERS: 00:00 - It ACTUALLY works! Workspace Computer Fun & Discussion 35:54 - Flux Ultra & RAW modes 42:37 - Google AI & Developer Relations, State of Models 48:55 - Claude Haiku 3.5 Pricing Weirdness 50:04 - Final thoughts ------ * Please note we expect full roll out of workspace computer to be completed by Monday night after this episode is live.
Thanks for listening and all of your support!
05 Jul 2024
EP69: Fun with kyutai's Moshi. SimTheory Beta is Here! + Future Assistants
01:16:00
Try SimTheory Beta: https://simtheory.ai Community: https://thisdayinai.com Show notes: https://thisdayinai.com/bookmarks/62-ep69 ---- 00:00 - Fun with kyutai's Moshi 28:06 - SimTheory Beta is available: what is new, what we learnt 49:04 - RunwayML Gen-3 Alpha 52:06 - Is AI in a Bubble? 59:52 - Claude Sonnet Prompt Leak for Artifacts 1:07:23 - Salesforce's 1B Parameter Model 1:14:14 - Moshi Interrupts Us
14 Jun 2023
OpenAI's API & Function Update, Microsoft Guidance, Andromeda Cluster, Meta Open Source | E19
01:06:50
The future is rushing at us like a freight train - will AI crush our dreams by 2043 or give us indefinite life extension? Explore the tangled web of AI hype, loneliness, and alcoholism with your hosts as they dive deep into OpenAI's latest updates, Zuck's vision for chatbots gone wild, and whether Big Tech's sudden pivot to "openness" is too little too late. From Microsoft's "Guidance" to steering GPT-3.5, fall down the rabbit hole of what's really going on behind the scenes—where are we headed and how soon will robots replace that guy in accounts payable (spoiler: probably not as soon as he thinks!). An epic voyage to the outer reaches of speculation that will leave you equal parts informed, entertained and possibly needing a stiff drink.
If you like this podcast please considering subscribing to the channel, liking and leaving a comment. We appreciate your support.
Chapters --- 00:00 - Mark Zuckerberg's Redemption Arc 00:14 - OpenAI's Function Calling, API updates and 16k GPT3.5 10:20 - Microsoft Guidance 18:05 - Andromeda Cluster for startups 26:24 - France's Mistral AI raises $113M seed round to take on OpenAI 33:40 - Mark Zuckerberg's comments about open source on Lex Friedman 40:27 - Deciding which LLM to use and when 44:13 - MusicGen by Meta & Adobe Firefly 48:14 - The Beatles are releasing a new song thanks to AI 53:37 - "Transformative AGI by 2043 less than 1% likely" paper 1:03:45 - Is AI Making Us Sad Drunks?
Google Bard, GitHub CoPilot X, GPT-4, Bing Image Creator, Are Our Jobs Safe? Model Extraction | E07
01:09:26
In Episode 7 of "This Day in AI Podcast" We Discuss The Launch of Google Bard, GitHub Copilot X, What it Means for The Future of Search, Give Updates on GPT-4, Discuss Bing Image Creator, Adobe FireFly and Cover The Anxiety of AI and The Oportunities and Threats it Creates.
00:00 - Crazy Code Comments 00:14 - Google Bard is Here! Google Bard Vs Bing Chatbot 02:49 - Why Didn't Google Use Claude for Bard? 04:50 - GPT-4 Vs Google Bard, According to Bard. Real time knowledge? 06:06 - Is The Future of Search In Context in Apps with AI? 07:51 - Bill Gates's letter "The Future of AI has Begun". Company Wide AI Agents. 11:56 - How Long Until Google Bard is Shutdown? Ask Bard. Will Google AI Catch Up? 16:10 - How Important is Speed? Dedicated AI Microsoft Copilot Is Faster. 16:54 - Microsoft Bing Image Creator Running Latest DALL-E. Is Midjourney v5 Better? 21:14 - Adobe FireFly Announced 22:38 - GitHub Copilot X Announced: Huge Productivity Gains Incoming. 25:27 - OpenAI's Jobs and Productivity Paper: Are Jobs Safe with AI? 28:21 - Company Wide AI Agents: The Evolution of Jobs? Productivity Gains? 34:23 - Can AI Startups Get Killed In Weeks? The Bigger Picture of AI Disruption 35:34 - The Unease of AI: Is AI Going To Replace My Job? 38:17 - The Desire for Authentic Human Content and Experiences 41:11 - Will AI LLMs Help Accelerate LLM and AI Adoption? 42:28 - Are Jobs Where We Have The Most Training Data at Risk First? 44:00 - The Next AI Winter? Lack of Training Data? 45:35 - Other LLMs, Stanford Alpaca. Custom Models are Coming! 48:50 - LLMs in Kids Toys? Products with AI? 49:16 - AI In Agriculture for Decision Making 51:11 - The Next Big Industry: Sensors for AI Data Collection & Training? BioGPT 52:05 - Model Extraction: Dangers? Can LLMs Create Children? 54:17 - ChatGPT Outage: Was Someone Model Extracting? 55:58 - Why It's Important to be Skeptical of AI & The Optimisitic Side 1:03:20 - Reverse Prompt Injection, SEO for AI Chatbots: What is AI Truth?
If you like this podcast please consider leaving a review. We really appreciate everyone's support.
This podcast is available on YouTube, Apple Podcasts, Spotify and anywhere else you get your podcast.
EP38: Ed Sheeran Listens to Our Podcast, Deep Fakes & Frontier Risks and AI Ears: SALMONN Model
01:08:13
Join the Discord: https://discord.gg/2j6k7AXw
This week, juicy revelations from Ed Sheeran and Taylor Swift's secret love affair! We also discuss the latest mind-blowing AI innovations, including talking heads, vision models that can see from every angle, and intelligent agents plotting world domination. Don't miss our spicy debate on whether AI will transform humanity or destroy us all. Plus advice from Chris on picking up virtual girlfriends using neural networks - this episode has it all!
Please note the Ed Sheeran bit is a joke (please don't sue us haha) and an example of a deep fake and deep fake technology for comedy. Please Ed. We're begging you.
Please consider reviewing the podcast to support the show. We read them all and they mean a lot to us :).
CHAPTERS ===== 00:00 - Ed Sheeran Actually Listens to Our Podcast 02:17 - Frontier Risk and Preparedness, Deep Fakes & VideoReTalking 15:06 - ByteDance's SALMONN AI Audio, Music, Sound Model for AI Hearing 23:01 - Adept's fuyu 8B Vision Model: The Future of How AI Agents Navigate the Web? 34:41 - Multiple Agents in the Metaverse & Zero123++ Making Single Images into 3D Objects 46:42 - Google's Gemini Leaks & Stubbs + Our Failed Gemini Leaker Source 50:17 - Is AI Boring? Chris Roasts Jacob Browning 1:03:41 - Bing's Sydney is Still Trying to Escape & Threatening Humanity
Chris Makes $10k, OpenAI App Store, GPT-4 Vision, FinGPT, EU Regulation | E20
01:03:21
In Episode 20 we discuss Chris winning $10k with GambleGPT while Mike was sick on his vacation. We take a look at FinGPT and discuss the future of AI agents and tools. We do a reality check on AI hype and cover the rumors of an OpenAI app store. Finally we discuss GPT-4 vision and how Bing is leaking some of the promised GPT-4 vision features. We also cover regulation updates in the EU and fake-AI dating app Blush.
Chapters: ---- 00:00 - Bing's GPT-4 Vision Captcha Solver 00:21 - Chris Made $10k from GambleGPT & AI Agent Updates 07:58 - FinGPT: Should Analysts Have a Job? 14:42 - The AI Hype Cycle: Where are we at? 18:12 - Are Creative Jobs in Trouble? "Secret Invasion" AI Opening, Grammys Banning AI 35:19 - Google Bans Employees Using Bard 37:14 - OpenAI App Store? Are Plugins a Failure? 43:42 - Where is GPT-4 Vision? In Bing! Solving Captchas 49:11 - Meta VoiceBox, EU Regulation & Should AI Content Be Labeled?
24 Jul 2024
EP71: Llama 3.1 Special Edition + GPT-4o Mini Fine Tuning & Chris's AI Poker Apology
01:03:38
Try Llama 3.1 Models on SimTheory: https://simtheory.ai Join our community: https://thisdayinai.com Show notes: https://thisdayinai.com/bookmarks/64-ep71 ------ CHAPTERS: ------ 00:00 - Llama 3.1 8B, 70B and 405B News & Initial Thoughts 27:44 - Discussion on Context Input Optimization, RAG and context focuses including "memory stack" 38:53 -Best model right now? GPT4-Mini daily driving & is Claude Sonnet 3.5 getting dumber? 42:17 - Official Llama 3.1 BOOM FACTOR scores 47:08 - GPT4o-Mini Fine Tuning is Now Available 53:19 - Chris's Apology for Ruining Online Poker with AI to the Poker Community ------
Thanks for listening and all of your support!
11 Aug 2023
EP27: Have We Reached AI Disillusionment? GPTBOT Web Crawler, Nvidia's AI GH200s, Zoom AI Scandal
01:03:02
This week we gossip about OpenAI's shady web crawling habits, laugh at Zoom's lame excuses for spying, and dream up the perfect AI crypto scam. Get the inside scoop on Nvidia's new trillion-parameter instrument of AI, hear our hot takes on the public's growing AI disillusionment, and find out what an AI HVAC administrator would sound like. Join your favorite AI bros as they dive deep on the latest AI hype and hardware gossip - this episode is chock full of spicy AI tea you won't want to miss!
Please consider leaving a review to help us reach 100 reviews if you listen on Apple Podcasts :)
CHAPTERS: ==== 00:00 - We Should Totally Do An AI Crypto Scam 00:27 - AI Meal Planner Suggests Chlorine Gas Recipe & AI with Personality 04:07 - OpenAI's GPTBot for Web Crawling 08:47 - Stealing content with AI, How to Protect Your IP from AI 14:37 - Zoom's Terms of Service for AI Training Scandal 25:25 - Nvidia's GH200 Announcement & Availability of Hardware 34:29 - Have We Reached AI Disillusionment? 52:54 - Generative AI LLMs for HVAC!? 55:31 - Claude Instant Version 1.2 Released 57:33 - AudioLDM 2: Text-to-audio/speech generation 1:00:41 - Skeptics Vs Optimists for AI (AI Crypto Bros)
This week, the Zuck strikes again - Meta unveils a state of the art AI code generator to challenge OpenAI's dominance. We explore the implications of AI models training themselves, and how it could accelerate capabilities. Then we put 11 labs' multilingual speech synthesis to the test, using it to generate a fake phishing call on our mother. Don't miss our scandalous experiments pushing AI to its limits in this jam-packed episode!
If you like the pod, please consider subbing, liking, commenting etc. xox
CHAPTERS: ===== 00:00 - Rehearsal of Phishing Our Mother (Cold Open) 00:19 - Meta's Code Llama 08:24 - Unnatural Instruction to Train AI Models 15:06 - Why Didn't Meta Release the Unnatural Instruction Code Llama Model? The Sparks of AGI? 16:50 - Evolution of GPT: Is Unnatural Instruction The Next Evolution of Models? 23:04 - DeepMind's Reinforced Self-Training ReST for Language Modeling paper and thoughts on future models 36:09 - Fine Tuning GPT-3.5 Turbo Announced by OpenAI: Should You Just Fine Tune Open Source? 44:05 - ElevenLabs Out of Beta and Multilingual v2: Explained by AI Us. 48:12 - Chris Tried to Figure Out AI Phishing 53:03 - Rehearsing Phishing Our Mother Call & Implications of This AI Tech 59:43 - How Much We Lost Not Investing in NVIDIA 1:01:29 - AI Bros Give Investment Advice
CHAPTERS: 00:00 - Is Deepseek R1 a Sputnik Moment? 15:32 - Industry Reaction to Deepseek R1 39:30 - Can Deepseek R1 Write a Good Dis Track? 46:21 - Will AI Disrupt All Software: Throw Away AI Software & Custom Interfaces 1:10:04 - OpenAI's Operator Thoughts & Computer Use in the Enterprise 1:16:45 - Google Releases Gemini 2.0 Flash Officially Released, Rumors of o3-mini & Farewell to o1 1:22:07 - In loving memory of o1...
--- thx 4 listening, like and sub.
24 Apr 2024
EP60: Rabbit r1 Launch Party, LAMs, Microsoft's Phi-3, Hume AI EVI API, Llama3 Updates & Groq Speed
01:01:15
Community: https://thisdayinai.com Show Notes: https://thisdayinai.com/bookmarks/52-ep60 SimTheory with Groq Llama3: https://simtheory.ai
Thanks for listening!
Llama3 Tunes Mentioned on Show: https://huggingface.co/Orenguteng/Lexi-Llama-3-8B-Uncensored https://huggingface.co/sherazkhan/Mixllama3-8x8b-Instruct-v0.1 https://huggingface.co/mattshumer/Llama-3-8B-16K https://huggingface.co/McGill-NLP/Llama-3-8B-Web
CHAPTERS: ===== 00:00 - Rabit r1 Launch Party & Can LAMs Be Useful? 13:40 - Microsoft's Phi-3 Impressions, Use Cases & Will It Kill Someone? 32:50 - Llama3, Gemini 1.5 API Closing in on GPT-4 & Llama3 on Groq 40:07 - A Week Later: SO Many Llama3 Fine Tunes and 16K Context 43:50 - Hume AI Releases AI EVI API: Empathic Voice Interface (and Lie Detector Test) 52:11 - Meta Has Put Llama 3 Everywhere with Meta AI. What is the point?
22 Feb 2024
EP52: The Groq Breakthrough, Google's Gemma 7B, Unlimited Context, Can 'Magic' Reason?
00:55:00
Show notes: https://thisdayinai.com/bookmarks/32-ep52 Groq Mixtral: https://simtheory.ai/agent/567-groq-mixtral-edition Groq Llama: https://simtheory.ai/agent/566-groq-the-speed-oriented-chat-companion SimTheory: https://simtheory.ai ==== This week we discuss Groq's LPU Chips and the implications of low cost low latency LLMs on custom hardware. We revisit our prank calling to see if Groq's low latency gives an advantage and see if we can improve Air Canada's chatbot. We discuss the launch of Google's Open Source Gamma 7B release and Magic's $148M fundraise for an AI co-worker who can reason. We also cover ChatGPT losing it's mind during the week.
If you like the show, please consider subscribing. Thanks for listening.
==== Chapters: 00:00 - Groq, Groq API and Retell with Groq 32:48 - Google Gemma 7B Open Source Model 39:04 - The 'Magic' Breakthrough on Reasoning and Context 50:19 - Sounds for OpenAI Sora Thanks to ElevenLabs Sound FX 51:59 - ChatGPT Goes Haywire
21 Jun 2024
EP67: Claude 3.5 Sonnet Beats GPT-4o + Ilya Sutskever's New Startup & Hedra lols
01:15:28
Show notes: https://thisdayinai.com/bookmarks/60-ep67 Community: https://thisdayinai.com SimTheory: https://simtheory.ai ---- CHAPTERS: 00:00 - Hedra Lols cold open 02:24 - Anthropic Claude 3.5 Sonnet Impressions 20:02 - Claude 3.5 Sonnet Vision Image Tests 25:32 - Claude 3.5 Sonnet Refusal Problems 28:54 - More on Claude 3.5 Sonnet, Artifacts and Future AI UI 51:41 - Hedra fun 58:29 - Thoughts on Ilya Sutskever's SSI Inc (Safe Superintelligence) 1:09:12 - Is AI Bad For Kids?
08 Mar 2024
EP54: Claude 3, Gemini 1.5 1M Context Seinfeld Experiment, OpenAI's DramaAI and Inflection 2.5
01:36:02
Join SimTheory: https://simtheory.ai Try Claude Opus: https://simtheory.ai/agent/689-claude-opus-your-conversational-companion Subscribe to This Day in AI Daily News: https://thisdayinai.com Show Notes: https://thisdayinai.com/bookmarks/41-ep54 Seinfeld Trivia Results: https://docs.google.com/spreadsheets/d/1crRzGE_JbQCIR5dEW_ORAq1QA9Yr8qquonZLILQRUpE/edit#gid=0
==== This week we cover Anthropic's impressive Claude 3 Opus, Sonnet and Haiku releases and play with Google's Gemini 1.5 1M Context using all the Seinfeld episodes ever written. We reluctantly recap and discuss the latest OpenAI drama, the Elon Musk lawsuit and finally cover Inflection's Inflection 2.5 release now available on Pi.
If you like the show sub, like, comment to feed the YouTube gods for us. xo.
CHAPTERS: ==== 00:00 - Anthropic Claude 3 36:05 - Is The Future of Programming LLM Function Abstraction? 47:13 - Google Gemini 1.5 1M Context Experiments 1:08:38 - If You Had AGI Tomorrow What Would You Do? 1:12:13 - OpenAI's DramaAI & Elon Musk Lawsuit 1:29:38 - Inflection 2.5 Release on Pi
28 Jul 2023
Llama2 Overtakes ChatGPT, The AI Cartel & Addictive AI Agents | E25
01:07:08
This week, in Episode 25 of This Day in AI we discuss the motivation behind the Frontier Model Forum (OpenAI, Google, Anthropic, Microsoft) and if Open Source remains the best approach for AI safety and security. We discuss Llama 2 being #2 on the AlpacaEval Leaderboard and its significance to the development of AI. We also discuss the paper on Universal and Transferable Adversarial Attacks on Aligned Language Models and how Mike can't prompt inject his AI girlfriend. We discuss how Mike cloned his friend with an AI Bot and the future implications. And also.. Stackoverflow AI, Stable Diffusion XL 1.0 and technology advances that might be being made by AGI!?
If you like this podcast please consider subscribing and leaving us a review.
CHAPTERS:
00:00 - Chris think Anthropic is a Safety Cult (cold open) 00:29 - Llama2 is Number 2, Ahead of ChatGPT on AlpacaEval Leaderboard 06:12 - Does Llama2 threaten OpenAI and Anthropic? 09:12 - Characters to Thwart Prompt Injection Attacks 17:51 - Debate on Regulation Vs Open Source for Safety, Senate Committe on AI and thoughts on AI Risk 36:28 - Is the Frontier Model Forum a Cartel? 42:05 - Mike's AI Girlfriend, Cloning a Friend and Talking to the Dead with AI 49:43 - Fine Tuning AI to Increase Retention 54:45 - Stack overflow Fights Back! Stack Overflow AI is here! 59:18 - Stable Diffusion XL 1.0 1:03:46 - Could Room Temperature Ambient Superconductors Could be a Sign of AGI!?
EP65: AI Doomerism, Qwen2, Kling Video Generation, Mistral Fine Tuning, Will Recall Be Recalled?
00:57:32
Join SimTheory: https://simtheory.ai Join the community: https://thisdayinai.com Show notes: https://thisdayinai.com/bookmarks/57-ep65 ----
CHAPTERS: 00:00: Fun with AI yet everyone is doom and gloom 13:25: Qwen2: our initial thoughts 22:12: Kling Video Generation 25:11: Mistral's Fine Tuning SDK: Chris Fine Tunes using Mistral 31:32: Looking Backwards: Streaming Video-to-Video Translation with Feature Banks & AI Deepfakes 40:06: Is Microsoft's Recall Going to Be Recalled? 44:00: The Next AI Money Making Experiment: AI Poker Agents 46:53: Apple WWDC: Will we get AI Agent Siri? 50:16: New SimTheory beta discussion
Thanks for listening!
Améliorez votre compréhension de This Day in AI Podcast avec My Podcast Data
Chez My Podcast Data, nous nous efforçons de fournir des analyses approfondies et basées sur des données tangibles. Que vous soyez auditeur passionné, créateur de podcast ou un annonceur, les statistiques et analyses détaillées que nous proposons peuvent vous aider à mieux comprendre les performances et les tendances de This Day in AI Podcast. De la fréquence des épisodes aux liens partagés en passant par la santé des flux RSS, notre objectif est de vous fournir les connaissances dont vous avez besoin pour vous tenir à jour. Explorez plus d'émissions et découvrez les données qui font avancer l'industrie du podcast.