A
The internet may not be big enough for the LLMs.
That’s because LLMs need really good data to train off of, and while the internet has a lot of data, it also has a lot of the Livejournal posts I wrote in 2003 that no one should be training anything they want to be coherent on.
This Wall Street Journal piece explores the way AI companies on beginning to reckon with a potential shortage of data to train on. Apparently it could mean a lot fewer “do anything” enormous LLMs and a lot more models trained for specific tasks on specific data sets. The people using LLMs trained on my Livejournal probably appreciate that.
Follow topics and authors from this story to see more like this in your personalized homepage feed and to receive email updates.
Loading comments
Getting the conversation ready...
Most Popular
Most Popular
- Midjourney goes from generating cat images to full-body ultrasound scans
- Apple’s weird anti-nausea dots cured my car sickness
- Tim Cook says RAM expenses are ‘unsustainable’ and Apple is going to raise prices
- This Ghost in the Shell keyboard makes me want to activate the hundred spidery robot fingers inside my regular fingers
- Amazon employees say they’re facing termination for backing data center limits











