cover of episode #198 - DeepSeek R1 & Janus, Qwen1M & 2.5VL, OpenAI Agents

#198 - DeepSeek R1 & Janus, Qwen1M & 2.5VL, OpenAI Agents

2025/2/2
logo of podcast Last Week in AI

Last Week in AI

AI Deep Dive AI Chapters Transcript
People
A
Andrey Kurenkov
J
Jeremie Harris
Topics
@Andrey Kurenkov : 我认为我们之前的播客对DeepSeek v3的预测是准确的,DeepSeek R1的结果并不令人意外。DeepSeek R1是一个与OpenAI的O1具有竞争力的语言模型,其优势在于推理能力。该模型的训练使用了强化学习方法,并取得了令人印象深刻的成果。DeepSeek R1的发布引发了美国科技股的剧烈波动,这反映了市场对AI技术发展前景的担忧和期待。然而,我认为市场对DeepSeek R1对英伟达的影响存在误读,它实际上利好英伟达的硬件生态系统。DeepSeek R1采用宽松的MIT许可证,这有利于其在商业和研究领域的应用。 此外,DeepSeek还发布了Janus Pro,一个性能优异的开源文本到图像模型。这些模型的发布表明,DeepSeek作为一个实验室,正在对开源AI领域产生重大影响。 @Jeremie Harris : DeepSeek V3是一个强大的基础模型,通过强化学习优化就能达到与GPT-4相当的水平。人们对DeepSeek R1对硬件的影响存在误读,它实际上利好英伟达的硬件生态系统。仅仅通过奖励模型正确答案就能有效提升大型语言模型的推理能力,这证明了强化学习的强大潜力。深度学习模型通过强化学习,能够自主发现并利用推理时间缩放定律,这表明该定律是AI系统的一个内在属性。模型会自然地采用比人类更有效率的推理方式,人类可解释性只是对模型的一种额外限制。DeepSeek R1是实际应用的模型,而R1.0则展示了强化学习的未来潜力。DeepSeek证明了可以以更低的成本获得与OpenAI O1相当的性能,这对于英伟达来说是利好消息。DeepSeek的成功凸显了算力在AI发展中的重要性,也进一步强调了出口管制的必要性。DeepSeek的手机应用在Google Play商店排名第一,这表明其模型获得了广泛的关注。DeepSeek的成功并不能改变算力在AI发展中的核心地位,未来算力仍然是决定AI竞争力的关键因素。

Deep Dive

Shownotes Transcript

Our 197th episode with a summary and discussion of last week's big AI news!
Recorded on 01/17/2024

Join our brand new Discord here! https://discord.gg/nTyezGSKwP

Hosted by Andrey Kurenkov and Jeremie Harris.
Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/.

In this episode:

- DeepSeek releases R1, a competitive AI model comparable to OpenAI’s O1, leading to market unrest and significant drops in tech stocks, including a 17% plunge in NVIDIA's stock. 
 - OpenAI launches Operator to facilitate agentic computer use, while facing competition from new releases by DeepSeek and Quen, with applications seeing rapid adoption.
 - President Trump revokes the Biden administration's executive order on AI, signaling a shift in AI policy and deregulation efforts.
 - Taiwanese government clears TSMC to produce advanced 2-nanometer chip technology abroad, aiming to strengthen global semiconductor supply amidst geopolitical tensions.

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Timestamps + Links: