Hacker Newsnew | past | comments | ask | show | jobs | submit | ikurei's commentslogin

Qwen 3.7 Max: > During my local testing before the full eval harness it was the only non-GPT model that was able to complete the task, was not able to reproduce in the longer runs.

Doesn't that sound like may be the harness was the problem?


I was using the same harness for each run, the difference is from when I was running the harness locally on my machine before I pushed up the full runs.

It could just alias sudo on your ~/.bashrc. No need to replace the actual file on /usr/bin/sudo or wherever you have it. I would only need to be able to run arbitrary code as you.

I so wanted to love the Steam Deck, but it's a device with a 7 inch screen that occupies a massive volume on your bag. Unless you know you're going to play a fair ammount, it's not worth carrying around.

It's a fantastic console, but a mediocre general purpose computer.


I have one, and I love it! I don't use it as much as I thought I would, but it brings joy everytime. If you have a need for an extremely portable, not very powerful device on your life, this might be it.

I agree with the complaint about the trackpad, but the keyboard has been just fine for me. Just a bit small, of course. I also find the screen perfectly acceptable for what I use this thing for: youtube, taking notes, writing emails, small bouts of coding and ssh'ing into servers.

My main complaint is related to battery management. May be it's becaused I'm used to Macbooks, but it drives me nuts to go pick the Minibook up and find that it has no power, because I haven't used it in a couple of days and I put it to sleep. I haven't measured, but the power use on sleep is noticeable, and I suspect the leakage while hibernating might be significant too.

I don't really like the laptop form factor. Laptops are the perfect solution for only one use case: using them on your lap. On a table, I'd rather have the computer be just a tablet, to add a bluetooth keyboard and mouse. At my desk, with bigger screens, I'd like the computer to disappear into a small puck or box, like a Mac Mini. With the Minibook, being so small, the form factor makes sense again. It's so portable, so easy to take with me to a coffee shop or on a trip, it's worth it.

A tablet with a keyboard might be a more practical solution, although generally more expensive, but I appreciate that my Minibook runs Linux so well, so I don't have to even think about Apple or Google telling me how to use my computer.


I haven't heard about this, could you please share more info, some reference on that Claude Code intentional bug?


I'm not sure what the mechanism is, but I've definitely had Claude refuse to work on sessions that were touched by other models. Some kind of integrity check failure. Resetting the session back to the point before I used the other model fixed the problem.


IIRC Anthropic's API produces cryptographic signatures for thinking blocks. If you try to submit a set of messages that include thinking blocks with missing/invalid signatures, it'll refuse.

They do this to mitigate jailbreak attempts that rely on fabricated message history (e.g. making it look like the model was compliant in previous messages, increasing the likelihood that it'll continue to be compliant in future messages).



I had a couple of eye-opening conversations about this the last time I was in San Sebastian. Not everyone there loves Mondragon as much as we think, some see it as a closed club that makes it arbitrarily hard to get a job with them depending on your connections. I met some workers unhappy of their hiring practices and I think their starting working conditions. No idea if they were fair or just resentful.

I still admire Mondragon and wish there were more companies like it, but now I try to remind myself that most characterizations from the outside are surely lacking in nuance.


Yeah the problem with co-ops is that every new employee means reduced profit-sharing for existing employees. Ironically, companies in SV actually have a nice solution to this problem: Employees get equity, but newer employees get less than older employees. So from the perspective of old employees, it still makes financial sense to hire new employees, because the old employee equity becomes more valuable.

In technical econ terms, the marginal profit of new employees is typically below the average profit of existing employees. A profit-maximizing business only cares that the marginal profit is positive, and will hire until there is no additional profit to be made. A co-op is incentivized to keep average profit per employee high, which can mean reducing headcount in order to keep the average strong. So that's why co-ops can have a sort of exclusive club feel to them.

SV is actually an interesting example which proves how employee ownership can drive prosperity, but the typical co-op crowd doesn't want to talk about it because it's too capitalist-coded. In a way, SV companies show that employee ownership is not some sort of instant cure for everything which ails capitalism.


What is ”SV” in this context? Silicon Valley? (Not a US resident, and I’ve never heard of any software cooperatives from there.)


Yes, SV=Silicon Valley. Startups aren't co-ops (the term "co-op" implies employee governance for example). But employee compensation is heavily loaded towards equity or stock in the employing firm. If you're an early hire at a unicorn, you can make serious money off your ownership share in the company. It helps align the incentives of the employees and management. Press coverage of Silicon Valley doesn't usually mention it, but employee ownership is a huge part of the culture.


> 1 in every 10 people in public spaces.

1 in every 10 people may have a cat alergy, but the % of folks with an allergy as severe as yours has to be much lower. I know plenty of people with cat allergies who can spend entire evenings in my cat-inhabited with only very minor discomfort. The person with the most serious allergy to them I know is miles away from your symptoms.

I think you are exaggerating the severity of the issue, but I'm sorry you have this terrible allergy to something as common as cats, that sucks.


> I think you are exaggerating the severity of the issue

You and everyone else who doesn't suffer. But I was conservative by stating 10%. Medical literature says 10 - 20% and even qualifies that as a potential underestimate. I have looked for stats on severe sufferers, and they are unfortunately very difficult to find.

It does suck. But I would caution you not to discount the discomfort of others so easily.

People tend to understand that exposing someone with a peanut allergy to peanuts is dangerous and can even be considered assault or attempted murder.

No one thinks that about cats.

But the severity of the allergic response occupies the same spectrum (same immune system, misbehaving in the same way). Peanuts just aren't as cute or fluffy as cats. No one is offended if you don't want to pet their peanut. No one makes you eat peanuts in order to visit them at home. No matter how mild the peanut allergy. No one rubs peanuts into every surface of a place like cats spread Fel D 1.

But immune systems don't know the difference. An allergen is an allergen.

To folks who have the allergy, the differences in the way it's treated compared to others affect our every day.


Less than 0.5% of people are at risk of anaphylaxis from cat allergies. Since you brought up peanut allergies, it's relevant to point out that we haven't banned peanuts. It sucks that you and others suffer, but getting rid of cats doesn't make sense when you can ask if there are cats around, much like people with peanut allergies ask about the presence of peanuts.


So that's 1 in 200 at mortal risk. Roughly 1,744,000 people in the US.

1 in 5 to 10 in discomfort. Roughly 69,760,000 people in the US.

Good to know. Given Dunbar's number it's likely that most people in the US know someone with a severe cat allergy.


Cat owners have significantly lower cardiovascular deaths. Children growing up with a cat have an almost 50% lower development of asthma and allergies. They reduce stress and depression.

https://pmc.ncbi.nlm.nih.gov/articles/PMC3317329/


I think you'll find that's as true for animals who don't shed proteins evolved to elicit severe allergic reactions as it is for those that do.

Your reference begins with: "The presence of pets has been associated with reduction of stress and blood pressure and therefore may reduce the risk of cardiovascular diseases."


Although I agree with other commenters that your command can't compare to all of bat's features, many of which I appreciate... thank you for sharing this tip, I didn't know about `highlight` and I can't install `bat` at work.

This will live in my .bashrc for a long time:

    cat() {
      if [[ -t 1 ]]; then
        command cat "$@" | highlight --force -O xterm256
      else
        # plain cat to pipe into other things
        command cat "$@"
      fi
    }


This... doesn't work? Everything just comes out green. It's not clear to me how 'highlight' could even possibly know what syntax it's supposed to be highlighting when processing stdin, unless it ingests the whole thing until EOF and then applies some kind of fuzzy logic. If you feed it a filename as an argument, it just checks the extension.


curious, how did you have highlight installed?

i'd change the third line so you can actually get syntax highlighting:

    command highlight --stdout --force -O xterm256 "$@"


Webapps are rewritten because a developer wanted to use the new shiny, or someone was convinced that everything will be better with the newer frameworks everyone is using. Also, it often goes hand in hand with giving it a more modern look-and-feel.

But the point is not whether webapps are rewritten, but whether they have to be rewritten. I know some old enterprise webapps made with PHP about 10 years ago that are still working fine.

You do have to worry about security issues, and the occasional deprecation of an API, but there is no reason why a web-based service should need to be rewritten just to keep working. Is that true for mobile and desktop apps?


If your webapp is simple a rewrite is no big deal and often cheaper than updating the old. As your project gets large that is no longer true. I work with embedded systems, when everything was small ( 8 bits isn't enough for anything else - new feature often means removing something else) we of rewrote large parts to get one new feature. it was easy to estimate a new project and we came in on time. As projects get bigger (32 and 64 bits are now available) we can't do that we can't afford a billion dollar project to rewrite every year.


>but there is no reason why a web-based service should need to be rewritten just to keep working

I mean most webapps of any size are built on underlying libraries, and sometimes those libraries disappear requiring a significant amount of effort to port to a new library.


It's not that you _need_ sympathy, or that football deserves or needs your sympathy like it's a good cause.

It's just generally good to try to understand others instead of distancing yourself from them. I find F1, jazz, finance, and so many other things to be really boring and uninteresting, but I try to get the people who like those and connect with them. F1 people and jazz people are often more interesting than their interests; I haven't gotten there with finance yet. The world is more interesting this way, but you're under no obligation.

> In a just world LaLiga would get sued into the ground for disabling a public utility on a level equivallent to an international cyberattack.

In a just world LaLiga and FIFA would've been sued into the ground like five scandals ago, but I don't think gtowey was suggesting you try to empathise with them, but with people who like football.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: