Subscribe
Sign in
Home
Chat
Articles
The Update
The Gradient Podcast
Archive
About
Latest
Top
Discussions
Update #73: Against Language Erasure and Better Long-Context Benchmarking
Researchers and governments develop data sets and technology for local languages, and NVIDIA presents a new, more thorough benchmark for long-context…
Apr 23
•
Cole Frank
,
Jaymee Sheng
, and
daniel bashir
8
Share this post
Update #73: Against Language Erasure and Better Long-Context Benchmarking
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
Mini-Update #39: 2024 AI Index Report and Autonomous Driving Detection
Stanford publishes its most comprehensive AI index report and researchers develop new methods of driving lane detection using sparse anchors.
Apr 16
•
Ather Fawaz
and
Jonathan Xue
8
Share this post
Mini-Update #39: 2024 AI Index Report and Autonomous Driving Detection
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
Update #72: NYC's Unfortunate Chatbot and Mixture-of-Depths
NYC's My City Chatbot makes concerning suggestions to users and DeepMind introduces a novel method for optimizing the resource allocation in transformer…
Apr 9
•
daniel bashir
,
Sharut Gupta
, and
Justin Landay
8
Share this post
Update #72: NYC's Unfortunate Chatbot and Mixture-of-Depths
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
Mini-Update #38: OpenAI's Voice Engine and LDM Scaling
OpenAI previews its Voice Engine but does not release it and Latent Diffusion Models (LDMs) show powerful capabilities at small size.
Apr 5
•
Ather Fawaz
and
Jonathan Xue
Share this post
Mini-Update #38: OpenAI's Voice Engine and LDM Scaling
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
March 2024
Update #71: Neuralink's First Human Trial and Controlling Language Model Hallucinations
Noland Arbaugh plays chess with his mind, and researchers find training strategies that reduce language model hallucinations.
Mar 26
•
Justin Landay
,
Sharut Gupta
, and
daniel bashir
11
Share this post
Update #71: Neuralink's First Human Trial and Controlling Language Model Hallucinations
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
4
Mini-Update #37: xAI's Open Source Grok and Apple's MM1
xAI open sources its LLM Grok and Apple releases MM1 – a series of multimodal LLMs and the lessons learned from training them.
Mar 21
•
Ather Fawaz
and
Jonathan Xue
1
Share this post
Mini-Update #37: xAI's Open Source Grok and Apple's MM1
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
Mini-Update #36: Claude 3 and 1-bit LLMs
Anthropic releases its most powerful chatbot yet, and researchers release an LLM that uses ternary values for its parameters.
Mar 13
•
Ather Fawaz
and
Jonathan Xue
2
Share this post
Mini-Update #36: Claude 3 and 1-bit LLMs
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
Update #70: Apple Shutters Autonomous EV Project and Griffin + Hawk Compete with Transformers
Apple terminates Project Titan and redirects resources to generative AI, and researchers propose two RNN-based architectures in the SSM line of work…
Mar 12
•
daniel bashir
and
Sharut Gupta
12
Share this post
Update #70: Apple Shutters Autonomous EV Project and Griffin + Hawk Compete with Transformers
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
February 2024
Update #69: Gemini Overcompensates for Bias and Missing Details in Sora
Gemini sparks AI culture wars and OpenAI releases a state-of-the-art video generation model.
Feb 27
•
daniel bashir
,
Justin Landay
, and
Sharut Gupta
10
Share this post
Update #69: Gemini Overcompensates for Bias and Missing Details in Sora
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
1
Mini-Update #35: Sora and Gemini 1.5 Announcements
OpenAI releases a new text-to-video model, and Google's Gemini 1.5 boasts an extremely large context window.
Feb 25
•
Ather Fawaz
and
Jonathan Xue
2
Share this post
Mini-Update #35: Sora and Gemini 1.5 Announcements
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
Update #68: Whispering Indigenous Languages and Neural Net Training Dynamics
Papa Reo explains issues with Whisper's ability to transcribe the Māori language, and researchers find that neural networks learn statistics of…
Feb 13
•
daniel bashir
and
Justin Landay
12
Share this post
Update #68: Whispering Indigenous Languages and Neural Net Training Dynamics
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
Mini-Update #34: Taylor Swift Deepfakes and Efficient Exploration for LLMs
Explicit AI-generated images of Taylor Swift spread across the internet, and DeepMind researchers propose a method for efficiently and effectively…
Feb 7
•
Ather Fawaz
and
Jonathan Xue
1
Share this post
Mini-Update #34: Taylor Swift Deepfakes and Efficient Exploration for LLMs
thegradientpub.substack.com
Copy link
Facebook
Email
Note
Other
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts