DeepSeek-V3.2 (5 minute read) DeepSeek's V3.2 matches GPT-5, and its higher-compute V3.2-Speciale variant rivals Gemini-3.0-Pro and earned gold medals at IMO, IOI, and ICPC 2025. | | A Practical Approach to Verifying Code at Scale (7 minute read) OpenAI trained an agentic code reviewer for Codex since noisy safety tools are inevitably bypassed. The system now handles 100k+ external PRs daily, providing repo-wide context and execution access. Internally, it has caught launch-blocking bugs and protected high-stakes experiments. | The End of the Train-Test Split (13 minute read) Models don't know what is 'out of distribution' for themselves. Engineers will always be required in the loop until this problem is solved. It will never be easy to check the accuracy of models. Researchers need to look at their data, make sure it's clean, and label it correctly. | How Can Interpretability Researchers Help AGI Go Well? (32 minute read) The Google DeepMind mechanistic interpretability team has pivoted to a pragmatic approach to interpretability over the last year. It is important to have empirical feedback on goals with good proxy tasks. Near-complete understanding isn't required for significant impact. Good focused projects start with a theory of change, and good exploratory projects start with a robustly useful setting. | | STARFlow: Scalable Transformer Auto-Regressive Flow (Hugging Face Repo) STARFlow and STARFlow-V are state-of-the-art transformer autoregressive flow models for high-quality image and video generation. STARFlow introduces a novel transformer autoregressive flow architecture that combines the expressiveness of autoregressive models with the efficiency of normalizing flows. STARFlow-V is an end-to-end video generative model with normalizing flows. Examples of generated videos and comparisons are available. | Bridge Models for Image and Video Translation (GitHub Repo) ViBT introduces Vision Bridge Transformers, scaling Brownian Bridge Models to 20B parameters for efficient conditional generation. The models use a Transformer architecture and a variance-stabilized objective for robust performance on image and video editing tasks. | | ByteDance's TikTok Playbook Is Winning Consumer AI (5 minute read) ByteDance's Doubao app is now China's most popular mobile AI platform. It received more than 11.4 million downloads in October. Doubao's focus on frictionless AI-powered voice, image, and video experiences sets it apart from competitors. ByteDance keeps its most advanced technology proprietary, breaking away from China's open-source approach. This could give it a durable commercial edge. | | | Love TLDR? Tell your friends and get rewards! | | Share your referral link below with friends to get free TLDR swag! | | | | Track your referrals here. | | Want to advertise in TLDR? 📰 If your company is interested in reaching an audience of AI professionals and decision makers, you may want to advertise with us. Want to work at TLDR? 💼 Apply here or send a friend's resume to jobs@tldr.tech and get $1k if we hire them! If you have any comments or feedback, just respond to this email! Thanks for reading, Andrew Tan, Ali Aminian, & Jacob Turner | | | |
0 Comments