I used to watch commercials during my starry-eyed teenage years as well. I used to think ninja turtles was anything but a 20 minute ad for toys when I was in preschool.
Ads are actually horrible. Holy moly I'm so glad I can just ublock my way through the sea of garbage that is the internet nowadays. The fact there are people still indoctrinated by commercials makes sense, I'm just sorry you see that as a marker of maturity.
If it's so clear, then your implication is strange. Can you not imply the answer you're looking for from this clarity? It's clear that you want an answer to a leading question instead of the truth.
I think the problem with this line of reasoning is that it's one-sided. Essentially you are saying "Just trust me bro" on behalf of a self-evaluating company.
I'd argue the potential for abuse is a perfectly reasonable discussion to have, and doesn't have much bearing on the effectiveness of anticheat, but I understand that's not the point you are trying to make.
Sorry, my writing should have been clearer, I put one too many negatives in. :-)
I didn't claim we should trust the company. Whether we can trust the anticheat maker is certainly part of the rigorous evaluation of the tradeoffs I mentioned. My point was that saying "it doesn't stop cheaters" is both incorrect and stifling to a more productive conversation, because it implies anticheat has no value and is therefore worth no risk.
As for me, if Gabe said "now you can opt your Steam Deck in to a trusted kernel we ship with anticheat and play PUBG," I'd probably do it. But that's because I, for better or worse, tend to trust Gabe. If Tencent were shipping it, I'd probably feel differently.
Compare: "I still get spam, therefore all these anti-spam measures are worthless"
It is absolutely the case that there would be more cheating if we turned off the only partially effective systems. We know this because they are regularly stopping and banning people!
People are going to to be upset when it happens but it is absolutely inevitable at some point Steam ships a Steam Deck with hardware based attestation of the OS being a signed version of SteamOS, feeding back to a Steam API, that can be used as the basis of an anti-cheat solution.
Making an account just to point out how these comments are far more exhausting, because they don't engage with the subject matter. They are just agreeing with a headline and saying, "See?"
You say, "explaining away the increasing performance" as though that was a good faith representation of arguments made against LLMs, or even this specific article. Questionong the self-congragulatory nature of these businesses is perfectly reasonable.
The level of proof for the latter is much higher, and IMO, OpenAI hasn't met the bar yet.
Something really funky is going on with newer AI models and benchmarks, versus how they perform subjectively when I use them for my use-cases. I say this across the board[1], not just regarding IpenAI. I don't know if frontier labs have run into Goodheart's law viz benchmarks, or if my use-cases that are atypical.
1. I first noticed this with Claud 3.5 vs Claud 3.7
That's a fair question, and I agree. I just find it odd how we shout across the aisle, whether in favor or against. It's a case of thinking the tech is neat, while cringing at all the money-people and their ideations.
The irony of telling someone not to be rude while being absolutely insufferable. Peak redditor behavior.
Please provide examples. Thank you!