Perplexity launches Perplexity Health agent in US (2 minute read) Perplexity has launched Perplexity Health in the US, entering the competitive consumer health AI space. It differentiates with a customizable hub and specialized AI agents, like nutrition and sleep assistants. Perplexity's strategy mirrors its finance segment, integrating real user data and leveraging AI for personalized insights. | | Scaling Karpathy's Autoresearch: What Happens When the Agent Gets a GPU Cluster (12 minute read) Researchers pointed Claude Code at autoresearch and gave it access to 16 GPUs on a Kubernetes cluster. Over 8 hours, it submitted around 910 experiments. The parallelism changed how the agent searched - with one GPU, it was stuck doing greedy hill-climbing, but with 16 GPUs, it ran factorial grids of 10 to 13 experiments per wave, catching interaction effects between parameters that sequential search would miss. | OpenClaw Was the WordPress Moment (3 minute read) OpenClaw showed the promise of autonomous AI agents but failed in production due to context management, non-determinism, weak tooling, and model limitations. Fully agentic systems are fragile, with real-world use cases relying on structured workflows plus LLM steps rather than pure autonomy. Next wave shifts to vertical "agent harnesses" that package domain-specific context, reliability, and infra into usable products. | World Models: Computing the Uncomputable (96 minute read) World Models are advancing AI by simulating real-world complexities through action-conditioned neural networks, enabling efficient prediction and planning. Generative and latent approaches to World Models are driving breakthroughs across various applications, from robotics to autonomous driving, leveraging vast datasets like gaming clips to approximate human decision-making. Major investments from industry leaders, including companies like General Intuition and World Labs, underscore the significant potential and rapid evolution of World Models in achieving general intelligence. | | Agent Auth Protocol (Website) The Agent Auth protocol makes the run-time agent a first-class principal. Each agent is registered with its own identity, granted specific capabilities, and governed by a lifecycle that the server controls. The server sees which agents are acting, what they are authorized to do, and can terminate agents without affecting anything else. The protocol is designed for existing infrastructure and is easy to adopt. | NanoGPT Slowrun: 10x Data Efficiency with Infinite Compute (7 minute read) Researchers achieved 10x data efficiency with NanoGPT Slowrun, a benchmark for language modeling algorithms in the infinite compute, within a few weeks. Data efficiency matters because compute grows much faster than data. Intelligence will eventually be bottlenecked by data, not compute. This data efficiency result allows researchers to improve model performance by scaling with compute rather than with data. | MolmoPoint from AI2 (5 minute read) MolmoPoint is a new open-source grounding architecture with three models, a 36K-image GUI dataset, and new tracking data for video and interface tasks. | | Broad Timelines (24 minute read) AI timelines remain uncertain, with experts divided on when AI will significantly change the world. The best approach isn't to pick a specific year, but to work with broad timelines that account for diverse expert opinions. Decision-makers should hedge against both short and long-term scenarios, investing in strategies adaptable to rapid changes or gradual shifts in AI development. | Rethinking open source mentorship in the AI era (7 minute read) AI contributions have increased noise in open source, making traditional mentorship signals unreliable. GitHub has introduced the '3 Cs' framework (Comprehension, Context, and Continuity) to help maintainers identify worthy contributors for mentorship. This strategy aids in maintaining mentorship efficacy and ensuring the growth and sustainability of open source communities amidst AI advancements. | Jensen Huang doesn't need a new chip. He needs a new moat (6 minute read) Nvidia's CEO Jensen Huang is aiming to transform the company from a chipmaker to a dominant AI platform operator through NemoClaw, an open-source platform for AI agents. This move minimizes reliance on chip sales cycles and positions Nvidia for more enduring, platform-based growth. However, success hinges on enterprise adoption and outpacing rapid advancements from Chinese competitors. | | | Love TLDR? Tell your friends and get rewards! | | Share your referral link below with friends to get free TLDR swag! | | | | Track your referrals here. | | | |
0 Comments