Fool Time - The birth of the internet, according to Jon Bois

P03 Locke@lemmy.dbzer0.com · 3 days ago

Citizenship isn’t free. You don’t get to have your cake and eat it, too.

P03 Locke@lemmy.dbzer0.com · 5 days ago

I don’t understand why Oracle or the reporter tried to invent a new term for this. This is literally “colocation”, a term so ubiquitous that the industry typically shortens it to “colo”.

P03 Locke@lemmy.dbzer0.com · 5 days ago

No study on caffeine wants to acknowledge the elephant in the room: Half the population has undiagnosed ADHD, and people use caffeine to self-medicate, usually unaware of why they are doing it.

P03 Locke@lemmy.dbzer0.com · 5 days ago

Starbucks uses local imports. The biggest problem is that they burn the shit out of the beans to normalize it down to the same flavor. So, the big appeal of using locally-sourced beans is wiped away by the way the need for a consistent “flavor”.

That’s why Starbucks coffee tastes like shit.

P03 Locke@lemmy.dbzer0.com · 5 days ago

P03 Locke@lemmy.dbzer0.com · 5 days ago

deleted by creator

P03 Locke@lemmy.dbzer0.com · 5 days ago

It’s a six-month decomm of Claude.

P03 Locke@lemmy.dbzer0.com · 5 days ago

Posting generational memes from 2016? That’s a bold choice.

P03 Locke@lemmy.dbzer0.com · edit-2 5 days ago

sysctl user.legal_bullshit.pretend_age_quote_verification_unquote=99

Watch that land on distros everywhere.

P03 Locke@lemmy.dbzer0.com · 5 days ago

Mark Twain started this nonsense.

P03 Locke@lemmy.dbzer0.com · edit-2 5 days ago

When they can argue its for “transformative use” or whatever the magic words are? Thats technically fair use in US law.

Well, considering they transformed its use to about 250GB of weights, that would qualify. That’s at least thousands of times less than the size of the books they downloaded, so you can’t really claim “they downloaded the books and put it into the model unaltered”.

It’s not like you can ask one of the models for page 156 of the second Harry Potter book, unless it’s cheating and attached to a search engine to try to find the result. There is no compression technique that can take something to a thousandth of its size without an substantial loss. You can, however, ask it to summarize what happened in the second Harry Potter book, including what the actual title is, without it trying to look it up on its own.

The AI bros might have a serious point within the law, and that should scare actual artists. It should also scare studios like Disney that hold a fuck ton of “intellectual property”.

Actual artists have been fucked over by copyright since its invention. Copyright, patents, and intellectual rights were created under the false pretense that it “protects the little person”, but these are lies told by the rich and powerful to keep themselves rich and powerful. Time and time again, we have seen how broken the patent system is, how it is impossible to not step on musical copyright, how Mark Twain, Sonny Bono, and Disney has extended copyrights to forever, and how the megacorporations have way more money than everybody else to defend those copyrights and patents. These people are not your friend, and their legal protections are not for you.

If the rich end up dismantling their own IP shield that has existed to enrich themselves for centuries in the name of AI progress, I’m going to call that a win.

P03 Locke@lemmy.dbzer0.com · 12 days ago

Meh, go play Terraria. It has less baggage.

P03 Locke@lemmy.dbzer0.com · 12 days ago

At my work, anybody can have Windows, Mac, or Linux. Each have an approved set of software that they can use. If it’s not on the approved list, and it’s something freely available and gets regular security updates, it’s usually not a problem to get it on the list.

I don’t have to explain to my co-workers what software I use. Most of the time, it’s cloud-based or web-based and universal, anyway.

P03 Locke@lemmy.dbzer0.com · 12 days ago

Didn’t someone at Google write a memo that was like “we’re kinda fucked b/c you can re-create this stuff with enough resources” like 2 years ago?

Basically, yes. They were specifically decrying the amount of open-sourcing they and their American competitors were doing, because capitalism, of course. Around this time, we had examples like StabilityAI’s StableDiffusion and Meta’s LLaMA as open-source models. And around this time, everybody else started closing their models, despite the fact that the research kept on going out in the open. StabilityAI kept their models open, mostly because they had no choice, but the attitude shifted towards profitability.

So, China took the open-source mantle, and these open/closed lines are being drawn strictly around national divisions as this American vs. China slant. Which is mostly a diversion of the real battle.

P03 Locke@lemmy.dbzer0.com · 12 days ago

Whoever wrote this article didn’t even bother to do the most basic of research.

DeepSeek fully admitted they started with ChatGPT outputs to train its model. And then they released it as an open-source model, so that everybody else can “steal” their work. On the image/video front, the general public has created every possible variation on top of every model you can think of. On top of that, any model that has ever been released with full weights has been spun into whatever variation or VRAM size you want.

The ugly truth that the American companies want to hide is the fact that they are spending trillions of dollars on an oligopoly that they can’t keep long-term. They hope that they can just keep spending more money to add more billions of parameters to their models, and keep technologically competitive with the secondary open-source models. But, they’ve already ran into diminishing returns over a year ago, and the global compute sector physically cannot keep up with demand for another cycle of even more diminishing returns.

The other factor is that realistic miniaturization of models is already here. Some of the smaller sizes aren’t as effective as the 250GB models they use on cloud-based services, but you can still do a lot with a 16GB or 24GB video card, using models of those sizes. Optimization and LLM quantization is getting better and better each year. The AI bubble burst is going to force a cascade shift into a new era of localization. Everybody is sick to fucking death of renting and subscribing to everything. Us pirates already do so on the media front, and soon localization of LLMs is going to become way more popular.

The question isn’t “Can people steal the tech?”. It’s “how long will people notice that it’s already happening?”