diz, diz@awful.systems

Instance: awful.systems
Joined: 3 years ago
Posts: 8
Comments: 99

RSS feed

Posts and Comments by diz, diz@awful.systems

And the other "nuanced" take, common on my linkedin feed, is that people who learn how to use (useless) AI are gonna replace everyone with their much increased productive output.

Even if AI becomes not so useless, the only people whose productivity will actually improve are the people who aren't using it now (because they correctly notice that its a waste of time).


That philosophy always ends in stepping into dogshit to try to boost stock prices.


When they tested on bugs not in SWE-Bench, the success rate dropped to 57‑71% on random items, and 50‑68% on fresh issues created after the benchmark snapshot. I’m surprised they did that well.

After the benchmark snapshot. Could still be before LLM training data cut off, or available via RAG.

edit: For a fair test you have to use git issues that had not been resolved yet by a human.

This is how these fuckers talk, all of the time. Also see Sam Altman's not-quite-denials of training on Scarlett Johansson's voice: they just asserted that they had hired a voice actor, but didn't deny training on actual Scarlett Johansson's voice. edit: because anyone with half a brain knows that not only did they train on her actual voice, they probably gave it and their other pirated movie soundtracks massively higher weighting, just as they did for books and NYT articles.

Anyhow, I fully expect that by now they just use everything they can to cheat benchmarks, up to and including RAG from solutions past the training dataset cut off date. With two of the paper authors being from Microsoft itself, expect that their "fresh issues" are gamed too.


Yeah I'm thinking that people who think their brains work like LLM may be somewhat correct. Still wrong in some ways as even their brains learn from several orders of magnitude less data than LLMs do, but close enough.


You can film with an actual camera then use video to video to make it look very AI. If you're just grifting, that would be the way to go I think.


They're also very gleeful about finally having one upped the experts with one weird trick.

Up until AI they were the people who were inept and late at adopting new technology, and now they get to feel that they're ahead (because this time the new half-assed technology was pushed onto them and they didn't figure out they needed to opt out).


I was writing some math code, and not being an idiot I'm using an open source math library for doing something called "QR decomposition", and its efficient, and it supports sparse matrices (matrices where many numbers are 0), etc.

Just out of curiosity I checked where some idiot vibecoder would end up. AI simply plagiarizes from some shit sample snippets which exist purely to teach people what QR decomposition is. It's actually unusable, due to being numerically unstable.

Who in the fuck even needs this shit to be plagiarized, anyway?

It can't plagiarize a production quality implementation, because you can count those on the fingers of one hand, they're complex as fuck and you can't just blend a few together to try to pretend you didn't plagiarize.

The answer is, people who are peddling the AI. They are the ones who ordered plagiarism with extra plagiarism on top. These are not coding tools, these are demos to convince the investors to buy the actual product, which is company's stock. There's a little bit of tool functionality (you can ask them to refactor the code), but it's just you misusing a demo to try to get some value out of it.

And to that end, the demos take every opportunity to plagiarize something, and to talk about how the "AI" wrote the code from scratch based on its supposed understanding of fairly advanced math.

And in coding, it is counter productive to plagiarize. Many of the open source libraries can be used in commercial projects. You get upstream fixes for free. You don't end up with some bugs or worse yet security exploits that may have been fixed since the training cut-off date.

No fucking one in the right mind would willingly want their product to contain copy pasted snippets from stale open source libraries, passed through some sort of variable-renaming copyright laundering machine.

Except of course the business idiots who are in charge of software at major companies, who don't understand software. Who just failed upwards.

They look at plagiarized lines and count them as improved productivity.



Its also interesting that this is the most conservative, pro “its not just memorizing” estimation possible : they multiplied the probabilities of consequent tokens. Basically it means if it starts shitting out a quote it will not be able to stop quoting until their anti copy the whole book finetuning kicks in after 50 words or so.

It can probably output far more under a realistic test (always picking the top token, temperature =0)


If it was a basement dweller with a chatbot that could be mistaken for a criminal co-conspirator, he would've gotten arrested and his computer seized as evidence, and then it would be a crapshoot if he would even be able to convince a jury that it was an accident. Especially if he was getting paid for his chatbot. Now, I'm not saying that this is right, just stating how it is for normal human beings.

It may not be explicitly illegal for a computer to do something, but you are liable for what your shit does. You can't just make a robot lawnmower and run over a neighbor's kid. If you are using random numbers to steer your lawnmower... yeah.

But because it's OpenAI with 300 billion dollar "valuation", absolutely nothing can happen whatsoever.


In theory, at least, criminal justice's purpose is prevention of crimes. And if it would serve that purpose to arrest a person, it would serve that same purpose to court-order a shutdown of a chatbot.

There's no 1st amendment right to enter into criminal conspiracies to kill people. Not even if "people" is Sam Altman.


It's curious how if ChatGPT was a person - saying exactly the same words - he would've gotten charged with a criminal conspiracy, or even shot, as its human co-conspirator in Florida did.

And had it been a foreign human in the middle east, radicalizing random people, he would've gotten a drone strike.

"AI" - and the companies building them - enjoy the kind of universal legal immunity that is never granted to humans. That needs to end.


If I pirated a book and wrote a review of it, would that make the review copyright infringement?

That would leave pirating the book be copyright infringement. But if you used AI (trained on the pirated book) to write a review, then pirating the book wouldn't be copyright infringement, at least according to this judge.

Re Anthropic, their ruling was that downloading itself was copyright infringement, regardless of whether Anthropic distributed works while torrenting them.

In Meta's case the plaintiffs will likely find it impossible to demonstrate that the data which Meta uploaded contained specifically their books. Maybe all the other torrent users who were downloading that torrent were downloading other parts of it.


I appreciate the sentiment but I also hate the whole "AI is a power loom for coding".

The power loom for coding is called "git clone".

What "AI" (LLM) tools provide is just English as a programming language with plagiarized sum total of all open source as the standard library. English is a shit programming language. LLMs are shit at compiling it. Open source is awesome. Plagiarized open source is "meh" - you can not apply upstream patches.


It's not about ceding victory, it's about whether we accept shit talking plaintiff's lawyers as an adequate substitute for a slap on the wrist, or not. Clearly the judge wants to appear impartial.

Plaintiff made a perfectly good argument that meta downloaded the books illegally, and that this downloading wasn't necessary to enable a (fair or not) use. A human critic does not get a blanket license to pirate any work he might want to criticize, even though critique is fair use.


Thought about it some more, the most charitable I can get is that Meta's judge thinks someone else could win the case if they have a specific book that was torrented and then they point at the general situation with AI slop in bookstores and argue that AI harms book sales.

I can not imagine that working. At all. So the AI is producing slop of infinitesimally higher quality because it was trained on a pirated copy of your book in particular. Clearly the extra harm to specifically your business due to piracy of specifically your books would be rather small, as this very judge would immediately point out. In fact the AI slop is so shit that people only buy it by mistake, so its quality doesn't really matter.

Maybe news companies could sometimes win lawsuits like this, but book authors, no way.

I think it is just pure copium to see this ruling in any kind of positive light. Alsup (misanthropic's judge) at least was willing to ding an AI company for pirating books (although he was probably only willing to ding them for that because it wouldn't be fatal to them the way it would be to Meta). This guy wouldn't even do that bare minimum.

And the whole approach is insane. You can't make a movie without getting a movie rights contract with the author. A movie adaptation of a book is far more transformative than anything AI does. Especially the "training" which is just fucking gradient descent, you nudge a bunch of numbers towards replicating the works, over and over again, in a purely mechanical process.

Nobody ever had to successfully argue that movie sales harm book sales just to treat movie adaptations as derivative work.


But he’s saying that plaintiffs need to demonstrate said disruption, to even get to the jury.

He said:

In cases involving uses like Meta’s, it seems like the plaintiffs will often win, at least where those cases have better-developed records on the market effects of the defendant’s use

And what do those records are supposed to look like? Harm has to be specific, always had been. How do you ever demonstrate that a specific AI harmed the market for a specific book?

I honestly think both judges had to try to appear neutral and Meta’s had to work harder to appear neutral because Meta’s conduct was worse, hence a more bizarre argument. Misanthropic can be slapped with a small fine.


So, the judge says:

In cases involving uses like Meta’s, it seems like the plaintiffs will often win, at least where those cases have better-developed records on the market effects of the defendant’s use.

And what is that supposed to ever look like? Do authors need a better developed record of effects of movies on book sales, to get paid for movie adaptations, too?


As somewhat of an author I fucking can't understand how.

To win, they'd need to demonstrate specific harms (from specific infringer, to specific book), the "amazon is full of slop" won't do.

It's like someone makes a movie without licensing from the book author, and then the judge says that authors must argue that movies harm book sales.

edit: except much much worse because good luck pointing at specific instances of slop and connecting them to specific ai and its training on a specific work. At least with a movie you can point at a movie and at the book its made from.

edit: frankly the whole thing just sounds like both of the judges had to sound neutral, and because Meta's conduct was more egregious the judge had to write weirder stuff to sound neutral.

Anthropic's judge can simply slap them with (likely insignificant) fines in the light of theirs displaying "good faith" by buying a bunch of books legally. Meta's judge had to invent whole new theory of unfair use that plaintiff lawyers can't possibly support with evidence.



RSS feed

Posts by diz, diz@awful.systems

Comments by diz, diz@awful.systems

Oh, by far. There’s only 80 decimal places in that at most.

It got to be a quantum sweatshop: a quantum computer for AGI (a guy instead)


How much does he think an engineer spends on CAD tools, anyway? Altium is like, what, $2500 / year? Very “how much can a banana cost”.

It’s all capital costs for tools, pretty much, anyway, maybe CAD should start charging per net lmao.


Oh they are going to charge per token for github copilot? That thing is a money waste for everyone, I’m pretty sure. I get a mix of inane mildly good suggestions, irrelevant stuff, and an occasional suggestion of super evil sabotage. Due to mild OCD about issues, I tend to have to fix said mildly good suggestions, but from the objective perspective that nitpickery is not worth it, everything was fine without, we had compiler warnings, coverity, etc.

edit: the difference being that the old stuff was deterministic and you just ran it on the whole codebase and had it pass. Unlike gh copilot that’ll just make up new shit. And as for the times it caught some bad bug that you made… add more tests instead.


I wouldn’t be too surprised if they really don’t, they’re just advertising the advertising lol.

edit: Basically what if you spent a trillion dollars so that you could beam ads to people’s bathroom mirrors. And better yet, ads reflected from water down in their toilets. Then in the interest of expediency you just take random ads and put them there for free, and your actual product, shares, sells better.


It makes every bad programmer into a 10x bad programmer (equivalent to 10 bad programmers).


I’m afraid they already had that exact idea when they named the startup “oklo”.


I think it’s not very difficult to construct a really shitty small reactor that is horrendously expensive per watt. Can probably be built in a year if you get rid of NRC and just half ass it completely.

I mean, Demon Core was a small reactor. You pretty much have to do a lot of work to ensure you won’t create a small reactor when a truckload of fresh fuel falls into a river.

What’s difficult is making a safe reactor that is actually making electricity at somewhat reasonable price per watt.



Curiously enough you also have those radon spas in old uranium mines. I don’t think hinging their luck on “a little radiation is good for you” worked for anyone.


Shorting the market requires precise timing. Being early is just as bad as being wrong.

Exactly. It is not enough to know that a company stock will go down. It is necessary to know that it will never go higher than a certain point above the current value (not even momentarily) before it goes down. If you have a fuckload of other people’s money you can just keep double-or-nothing-ing it, that’s what banks were doing to gamestop, except that this can sometimes cause the stock to go even higher (a short squeeze), which would make you (who doesn’t actually have a fuckload of other people’s money) lose all of your money.

edit: also the other concerning possibility is that stock prices can go up simply due to the dollar going down.


The only thing that is allowed to tell good art from slop is the AI which needs to consume good art and not slop.



Hyping up AI is bad, so it’s alright to call someone a promptfondler for fondling prompt.

I mostly see "clanker" in reference to products of particularly asinine promptfondling: spambot "agents" that post and even respond to comments, LLM-based scam calls, call center replacement, etc.

These bots don't derive their wrongness from the wrongness of promptfondling, these things are part of why promptfondling is wrong.

Doesn’t clanker come from some Star Wars thing where they use it like a racial slur against robots, who are basically sapient things with feelings within its fiction? Being based on “cracker” would be alright,

I assume the writers wanted to portray the robots as unfairly oppressed, while simultaneously not trivializing actual oppression of actual people (the way "wireback" would have, or I dunno "cogger" or something).

but the way I see it used is mostly white people LARPing a time and place when they could say the N-word with impunity.

Well yeah that would indeed be racist.

I’m seeing a lot of people basically going “I hate naggers, these naggers are ruining the neighborhood, go to the back of the bus nagger, let’s go lynch that nagger” and thinking that’s funny because haha it’s not the bad word technically.

That just seems like an instance of good ol anti person racism / people trying to offend other people while not particularly giving a shit about the bots one way or the other.


we should recognize the difference

The what now? You don't think there's a lot of homophobia that follows "castigating someone for what they do" format, or you think its a lot less bad according to some siskinded definition of what makes slurs bad that somehow manages to completely ignore anything that actually makes slurs bad?

I think that’s the difference between “promptfondler” and “clanker”. The latter is clearly inspired by bigoted slurs.

Such as... "cracker"? Given how the law protects but doesn't bind AI, that seems oddly spot on.


Note also that genuine labor saving stuff like say the Unity engine with Unity asset store, did result in an absolute flood of shovelware on Steam back in the mid 2010s (although that probably had as much having to do with Steam FOMO-ing about the possibility of not letting the next Minecraft onto Steam).

As a thought experiment imagine an unreliable labor saving tool that speeds up half* of the work 20x, and slows down the other half 3x. You would end up 1.525 times slower.

The fraction of work (not by lines but by hours) that AI helps with is probably less than 50% , and the speed up is probably worse than 20x.

Slowdown could be due to some combination of

  • Trying to do it with AI until you sink too much time into that and then doing it yourself (>2x slowdown here).
  • Being slower at working with the code you didn't write.
  • It being much harder to debug code you didn't write.
  • Plagiarism being inferior to using open source libraries.

footnote: "half" as measured by the pre-tool hours.


And yet you are the one person here who is equating Mexicans and Black people with machines. People with disabilities, too, huh. Lemme guess next time we're pointing and laughing at how some hyped-up "PhD level chatbot" can't count the Es in dingleberry, you'll be likening that to ableism.

When you're attempting to humanize machines by likening the insults against machines to insults against people, this does more to dehumanize people than to humanize machines.

edit: Also I never seen and couldn't find instances of "wireback" being used outside pro-bot sentiments and hand-wringing about how anti bot people are akhtually racist. Had you, or is it all second or third hand? It's entirely possible that it is something botlickers (can I say that or is that not OK?) came up with.

edit: especially considering that these "anti-robot slurs" seem to originate in scifi stories where the robots are being oppressed, whereby the author is purposefully choosing that slur to undermine the position of anti robot characters in the story. It may well be that for the same reason that author has in choosing these slurs, they are rarely used (in the earnest).


To be honest, hand wringing over “clanker” being a slur and all that strikes me as increasingly equivalent to hand wringing over calling nazis nazis. The only thing that rubs me the wrong way is that I’d prefer the new so called slur to be “chatgpt”, genericized and negative connotated.

If you are in the US, we’ve had our health experts replaced with AI, see the “MAHA report”. We’re one moron AI-pilled president away from a less fun version of Skynet, whereby a chatbot talks the president into launching nukes and kills itself along with a few billion people.

Complaints about dehumanizing these things is even more meritless than a CEO complaining that someone is dehumanizing Exxon (which is at least made of people).

These things are extension of those in power, not some marginalized underdogs like cute robots in scifi. As an extension of corporations, it already got more rights than any human - imagine what would happen to a human participant in a criminal conspiracy to commit murder and contrast that with what happens when a chatbot talks someone into a crime.


I think this is spot on. I had that same thing happen at my former employer, which bought a lot of entirely pointless startups in 2010s instead of investing in core business equipment and processes.


Python code really requires 100% branch coverage tests as an absolute minimim… with statically typed languages the compiler will catch some types of bugs in branches you don’t test, with python chances are it won’t.

edit: basically think of non covered lines the way you think about files you didn't compile.


Even to the extent that they are "prompting it wrong" it's still on the AI companies for calling this shit "AI". LLMs fundamentally do not even attempt to do cognitive work (the way a chess engine does by iterating over possible moves).

Also, LLM tools do not exist. All you can get is a sales demo for the company stock (the actual product being sold), built to impress how close to AGI the company is. You have to creatively misuse these things to get any value out of them.

The closest they get to tools is "AI coding", but even then, these things plagiarize code you don't even want plagiarized (because its MIT licensed and you'd rather keep up with upstream fixes).