First real samples of Veo 3.1 generated videos (2 minute read) Veo 3.1 marks a noticeable step forward in prompt fidelity and visual/audio quality. It doesn't have the same issues as Veo 3, like occasional oddities with object proportions. The model demonstrates a better understanding of nuance, creating videos that closely match prompt intent. Traces of the model have appeared in Vertex AI and Google Vids. An official release is likely in the coming weeks. | Musk's xAI joins race to build 'world models' to power video games (4 minute read) xAI has joined Meta and Google in the race to develop AI systems that can navigate and design physical environments. The company is building world models with the goal of applying them in gaming. These models could be used to generate interactive 3D environments and unlock uses for AI beyond software and computers in physical products. Elon Musk announced on X that xAI would release a 'great' AI-generated game before the end of next year. | | Is AI a bubble? (22 minute read) Not yet, but it's close. GenAI is a demand-led, capital-intensive boom with revenues doubling annually and expected to hit $100 billion by 2026. Capital expenditure to revenue ratios (six times) are high, exceeding both railroads (two times) and telecom (four times) at their peaks, but there are other indicators: economic strain above 2% of GDP, industry strain with capex over 10x revenue, revenue growth slowing below 50% annually, price-to-earnings ratios hitting 50-60, or internal cash covering under 25% of capex. Until two of these are met, the current risk, while large, is manageable. | Why are embeddings so cheap? (32 minute read) Embeddings are a fundamental component of every modern retrieval augmented generation system. This post shows what computations are required to produce an embedding and how this can be used to calculate the true dollar cost of processing a token. The cost of processing a token is minuscule, and all embedding models converge to similar semantic representations, so no provider really enjoys a sustainable moat. This results in prices racing to the bottom as the underlying low costs are being passed directly on to consumers. | | Claude Code Plugins Now in Public Beta (3 minute read) Anthropic has introduced plugin support in Claude Code, allowing users to install custom slash commands, agents, MCP servers, and hooks with a single command. These modular extensions simplify setup sharing and enable flexible customizations for development workflows. | InferenceMAX (GitHub Repo) InferenceMAX continually re-benchmarks the world's most popular open-source inference frameworks and models to track real performance in real time. It captures progress in near real-time. A live dashboard is available for free. | ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory (1 minute read) ReasoningBank is a novel memory framework that distills generalizable reasoning strategies from an agent's self-judged successful and failed experiences. Agents can store and retrieve memories from ReasoningBank to inform their decisions. They can integrate new learnings and become more capable over time. The better memory enables more effective scaling. ReasoningBank consistently outperforms existing memory mechanisms that store raw trajectories or only successful task routines across web browsing and software engineering benchmarks. | Meta's Agent Learning (19 minute read) Meta has introduced "early experience," a training approach using data from an agent's own interactions without external rewards. It improves policy learning via implicit world modeling and self-reflection. | | Why America builds AI girlfriends and China builds AI boyfriends (15 minute read) A market scan of 110 AI companion platforms reveals stark differences: 52% are US-based, targeting young men with hypersexualized anime AI girlfriends monetized through NSFW premium features. Chinese platforms target adult women aged 25-40 with dynamic AI boyfriends featuring card-collecting mechanics and WeChat integrations, shaped by China's marriage crisis (20% drop in 2024) and gender imbalance. | The ChatGPT App Store Moment (2 minute read) OpenAI just launched apps inside ChatGPT where you can say "Spotify, make me a playlist" or "DoorDash, order my usual" and it handles everything conversationally. This looks like early iPhone web apps, not true native experiences yet, but the trajectory is clear. As OpenAI expands its SDK to give developers access to user memory, persistent state, and action execution, we'll see actual native ChatGPT applications designed for conversational interfaces. | | | Love TLDR? Tell your friends and get rewards! | | Share your referral link below with friends to get free TLDR swag! | | | | Track your referrals here. | | Want to advertise in TLDR? 📰 If your company is interested in reaching an audience of AI professionals and decision makers, you may want to advertise with us. Want to work at TLDR? 💼 Apply here or send a friend's resume to jobs@tldr.tech and get $1k if we hire them! If you have any comments or feedback, just respond to this email! Thanks for reading, Andrew Tan, Ali Aminian, & Jacob Turner | | | |
0 Comments