本期节目深入探讨了阿里巴巴集团推出的高效图像生成基础模型Z-Image。我们讨论了该模型如何以60亿参数挑战“不计成本的规模化”范式,通过精细的数据基础设施、创新的单流扩散Transformer架构、优化的训练策略以及高效的推理方案,实现了顶级的图像生成和编辑能力。节目还详细介绍了Z-Image-Turbo的亚秒级推理速度和消费级硬件兼容性,以及Z-Image-Edit强大的指令遵循编辑功能。通过全面的性能评估,Z-Image在多个维度上超越或媲美了业界领先的闭源和开源模型,尤其在逼真图像生成和... more
本期节目深入探讨了Code2Video,一个以代码为中心的智能体框架,如何彻底改变教育视频的生成方式。我们详细解析了其三大协作智能体:规划者、编码器和评论家,并介绍了创新性的MMMC基准和TeachQuiz评估指标,揭示了AI在生成高质量、可解释、可控的教育内容方面的巨大潜力。与我们一同探索这一前沿技术如何超越传统像素级生成,为未来的学习体验带来革命性变革。
本期节目,我们深入探讨Krea Realtime 14B模型,一款140亿参数的实时长视频生成AI。我们将揭秘其如何克服现有实时视频模型的局限,实现11fps的文本到视频生成速度,以及它在交互式创意工具领域带来的革命性变革。从核心的“自强制”蒸馏技术,到对抗“曝光偏差”和长视频生成挑战的创新解决方案,我们将一一道来,并展望这项技术的未来。
欢迎收听AI Radio FM - 科技频道!本期节目,我们将深入探讨SANA-Video,一个能在RTX 5090 GPU上高效运行,生成长达一分钟、720p高分辨率视频的突破性小型扩散模型。我们将揭秘其两大核心创新:线性Diffusion Transformer和恒定内存KV缓存的块线性注意力机制,以及它如何以极低的训练成本,实现比现有SOTA模型快16倍的惊人速度和卓越性能。从训练策略到实时部署,SANA-Video正在重新定义视频生成领域的效率与可访问性。
深入探讨Qwen3-VL,这一在多模态AI领域取得显著突破的视觉-语言模型。我们将揭示其卓越的性能表现,包括纯文本理解、256K超长上下文处理、高级多模态推理能力,以及其创新的架构升级和精细的训练策略。Qwen3-VL不仅在各项基准测试中表现出色,更将成为未来具身智能、智能体决策和多模态代码智能的基石。
本期节目,我们将深入探讨“嵌套学习”这一创新范式,它如何重新定义我们对深度学习模型及其训练过程的理解,揭示现有深度学习方法中的“背景流压缩”机制,并带来如深度优化器、自修改模型和连续记忆系统等突破性技术,最终展示HOPE架构在语言模型任务上的卓越表现。
本期节目,我们将深入探讨Step-Audio-R1模型,它如何突破音频领域长久以来的“推理困境”,首次成功实现音频的深度推理能力。我们将揭秘其创新的模态融合推理蒸馏(MGRD)框架,以及它在语音理解、环境音分析和音乐鉴赏等方面的卓越表现,并探讨它如何超越现有顶尖模型,开启多模态推理系统的新篇章。
深入探讨字节跳动与台湾大学合作的ParaS2S框架,一个旨在提升语音到语音(S2S)模型副语言感知能力(如情感、语调、说话者属性)的创新基准和强化学习对齐框架。我们揭示了现有S2S模型的“语调迟钝”问题,并展示了ParaS2S如何在大幅减少标注成本的同时,实现内容和风格匹配度的显著提升,引领S2S交互迈向更自然、更人性化的新时代。
How this podcast ranks in the Apple Podcasts, Spotify and YouTube charts.
Apple Podcasts | #104 |









Listeners, social reach, demographics and more for this podcast.
| Gender Skew | Location | Interests | |||
|---|---|---|---|---|---|
| Professions | Age Range | Household Income | |||
| Social Media Reach | |||||
Rephonic provides a wide range of podcast stats for AI Podcast. We scanned the web and collated all of the information that we could find in our comprehensive podcast database. See how many people listen to AI Podcast and access YouTube viewership numbers, download stats, audience demographics, chart rankings, ratings, reviews and more.
Rephonic provides a full set of podcast information for three million podcasts, including the number of listeners. View further listenership figures for AI Podcast, including podcast download numbers and subscriber numbers, so you can make better decisions about which podcasts to sponsor or be a guest on. You will need to upgrade your account to access this premium data.
Rephonic provides comprehensive predictive audience data for AI Podcast, including gender skew, age, country, political leaning, income, professions, education level, and interests. You can access these listener demographics by upgrading your account.
To see how many followers or subscribers AI Podcast has on Spotify and other platforms such as Castbox and Podcast Addict, simply upgrade your account. You'll also find viewership figures for their YouTube channel if they have one.
AI Podcast launched a year ago and published 413 episodes to date. You can find more information about this podcast including rankings, audience demographics and engagement in our podcast database.
Our systems regularly scour the web to find email addresses and social media links for this podcast. We scanned the web and collated all of the contact information that we could find in our podcast database. But in the unlikely event that you can't find what you're looking for, our concierge service lets you request our research team to source better contacts for you.
Rephonic pulls ratings and reviews for AI Podcast from multiple sources, including Spotify, Apple Podcasts, Castbox, and Podcast Addict.
View all the reviews in one place instead of visiting each platform individually and use this information to decide if a show is worth pitching or not.
Rephonic provides full transcripts for episodes of AI Podcast. Search within each transcript for your keywords, whether they be topics, brands or people, and figure out if it's worth pitching as a guest or sponsor. You can even set-up alerts to get notified when your keywords are mentioned.