Mistral Small 3 (6 minute read) Mistral has released a very powerful 24B model that achieves strong performance, especially in multilingual data. It is the perfect size for deployment and strength. | Figure AI details plan to improve humanoid robot safety in the workplace (4 minute read) Figure AI is establishing the Center for the Advancement of Humanoid Safety to address gaps in safety for robots in workplaces. Led by former Amazon Robotics safety engineer Rob Gruendel, the initiative will focus on testing and certifying robots to industrial safety standards. The company aims to provide transparency with quarterly updates on testing processes and improvements. | | 3D Occupancy Prediction (9 minute read) SliceOcc introduces a novel vertical slice representation for 3D semantic occupancy prediction in dense indoor environments. It achieves state-of-the-art performance using an RGB camera-based model. | Explainable Query Optimization (21 minute read) Reqo is a new query optimization model that leverages Bi-GNN and probabilistic ML to improve cost estimation accuracy. It introduces an explainability technique that highlights the contribution of query subgraphs. | | Rigging Chatbot Arena Rankings (GitHub Repo) Researchers demonstrate that crowdsourced voting on Chatbot Arena can be manipulated to boost or lower model rankings using strategic rigging techniques, impacting the leaderboard's reliability. | Qwen2.5-VL Cookbooks (GitHub Repo) Qwen2.5-VL, an amazing new vision language model, has a companion set of cookbooks that show how to use the model for various different tasks. | | A New Way to Test AI for Sentience: Make It Confront Pain (6 minute read) Researchers from Google DeepMind and LSE conducted a study using a text-based game to explore AI "sentience," testing LLMs by having them choose between options with varying pain and pleasure associations. Findings revealed some models prioritized avoiding pain over scoring points, suggesting a potential framework for assessing AI consciousness. | AI's coding promises, and OpenAI's longevity push (6 minute read) The second wave of AI coding is advancing, allowing models to prototype, test, and debug code, potentially moving developers into more supervisory roles. OpenAI has entered longevity science with a model that designs proteins to transform cells into stem cells, claiming results surpassing human efforts. Cleaner jet fuels from alternative sources are gaining momentum, promising significant emission reductions and prompting industry shifts. | | Love TLDR? Tell your friends and get rewards! | Share your referral link below with friends to get free TLDR swag! | | Track your referrals here. | Want to advertise in TLDR? 📰 If your company is interested in reaching an audience of AI professionals and decision makers, you may want to advertise with us. Want to work at TLDR? 💼 Apply here or send a friend's resume to jobs@tldr.tech and get $1k if we hire them! If you have any comments or feedback, just respond to this email! Thanks for reading, Andrew Tan & Andrew Carr | | | |
0 Comments