Latest

6/recent/ticker-posts

Header Ads Widget

OpenAI hallucination benchmark 📚, Anthropic social bias study 📃, DeepMind Audio Generation 🔊

MeshRet has introduced a novel approach for improving motion retargeting for 3D characters that focuses on preserving body geometry interactions ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌  ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ 

TLDR

TLDR AI 2024-10-31

🚀

Headlines & Launches

Pushing the Frontiers of Audio Generation (6 minute read)

DeepMind has talked a little bit more about the audio generation models used to power NotebookLM.
OpenAI's new hallucination benchmark (7 minute read)

OpenAI has released the SimpleQA benchmark, which measures models' abilities around simple factual questions.
Evaluating feature steering: A case study in mitigating social biases (17 minute read)

This study explores using feature steering in AI models to interpretably modify outputs. It reveals a "steering sweet spot", where changes do not degrade capabilities. The study results show steering can alter social bias in targeted domains but also brings unexpected off-target effects. Further research is required to refine feature steering for safer, more reliable outcomes in AI models.
🧠

Research & Innovation

ThunderKittens 2 (17 minute read)

Thunder Kittens is a framework for writing extremely performant GPU Kernels. It is built on the idea that GPUs actually want to operate on small 16x16 tiles of data. In turn, the useability is quite high, and 40% faster kernels only take a few hundred lines of code.
Realistic Motion Retargeting (2 minute read)

MeshRet has introduced a novel approach for improving motion retargeting for 3D characters that focuses on preserving body geometry interactions from the start.
Better Generation with Self-Guidance Sampling (18 minute read)

Researchers have enhanced Masked Generative Models (MGMs) with a new self-guidance sampling method, improving their image generation quality while maintaining diversity.
🧑‍💻

Engineering & Resources

Speeding Up Transformers with Token Merging (GitHub Repo)

This project introduces PiToMe, an algorithm that compresses Vision Transformers by progressively merging tokens after each layer. This method reduces the number of tokens processed.
3D Reconstruction Without Pose Data (3 minute read)

PF3plat tackles the challenge of pose-free 3D reconstruction and novel view synthesis from RGB images, eliminating the need for extra data.
A Benchmark for Evaluating Data Curation Methods (GitHub Repo)

SELECT is the first large-scale benchmark for comparing data curation strategies in image classification. ImageNet++ is a new dataset that extends ImageNet-1K with five new training-data shifts, each assembled using different curation techniques.
🎁

Miscellaneous

Fine-tuning LLMs to 1.58bit: extreme quantization made easy (24 minute read)

BitNet, developed by Microsoft Research, introduces a transformer architecture that reduces LLM computational and memory requirements by using ternary precision (-1, 0, 1) equating to 1.58 bits per parameter. Models are required to be trained from scratch. BitNet can also fine-tune existing models to this low-precision format, maintaining strong performance on downstream tasks. This approach significantly reduces energy consumption and improves inference speed using specialized kernels for efficient matrix multiplication.
How we saved hundreds of engineering hours by writing tests with LLMs (7 minute read)

Assembled uses LLMs to accelerate and improve software testing, enabling test generation in minutes instead of hours. This approach increases engineering productivity, saving time and shifting focus to feature development. LLMs generate comprehensive and accurate tests that maintain code quality and development velocity.
25% of Smartphone Owners Don't Want AI as Apple Intelligence Debuts (6 minute read)

A CNET survey revealed that only 18% of smartphone users are motivated by AI features to upgrade their devices, with privacy and cost being significant concerns. Major manufacturers like Apple, Google, and Samsung are integrating more AI capabilities in their phones, yet many users prioritize battery life and storage over AI functions. AI subscriptions are set to become common, but nearly half of users are unwilling to pay for these features.

Quick Links

Rime AI achieves 100% API uptime in 2024 with Baseten (Sponsor)

Rime AI, who trains custom speech synthesis models with over 200 distinct voices, achieved <300 milliseconds p99 API latency with perfect uptime after switching to Baseten for model inference. Read Rime's story.
Google preps 'Jarvis' AI agent that works in Chrome (2 minute read)

Google's Project Jarvis, powered by Gemini 2.0, aims to automate web-based tasks in Chrome by using AI agents capable of reasoning and planning.
OpenAI's Whisper transcription tool has hallucination issues, researchers say (1 minute read)

Concerns have risen over OpenAI's Whisper introducing hallucinations in transcriptions, even in medical contexts.
Forerunner K2 humanoid robot can carry 33 lb in each dexterous hand (3 minute read)

Kepler has unveiled the Forerunner K2 humanoid robot, which has advanced AI, improved hardware, and enhanced vision and navigation systems for better real-time interaction.

Love TLDR? Tell your friends and get rewards!

Share your referral link below with friends to get free TLDR swag!
Track your referrals here.

Want to advertise in TLDR? 📰

If your company is interested in reaching an audience of AI professionals and decision makers, you may want to advertise with us.

If you have any comments or feedback, just respond to this email!

Thanks for reading,
Andrew Tan & Andrew Carr


If you don't want to receive future editions of TLDR AI, please unsubscribe from TLDR AI or manage all of your TLDR newsletter subscriptions.

Post a Comment

0 Comments