Hacker Newsnew | past | comments | ask | show | jobs | submit | Robin_f's commentslogin

Anthropic mostly had an advantage in speed. It feels like with a 25% increase in speed with Codex 5.3, they are now losing that advantage as well.


I just asked Opus 4.6 to debug a bug in my current changes and it went for 20 minutes before I interrupted it. Take that as you will.


Doesn't feel like a useful data point without more context. For some hard bugs I'd be thrilled to wait 30 minutes for a fix, for a trivial CSS fix not so much. I've spent weeks+ of my career fix single bugs. Context is everything.


Sure, but I've never experienced a 20 minute wait with CC before. It was an architectural question but it would have taken a couple minutes with a definitive answer on 4.5.


> I've spent weeks+ of my career fix single bugs.

Same, same. It's not a useful data point at all.

bug: llm alignment

timeframe to fix : probably never


As they point out in their blog post, 4.6 intentionally thinks longer, but you can adjust it from the CLI.


Similar thing happened to me 3 months ago. To this day no response to any appeals. I've actually started a GDPR request to see why I got banned, which they're stretching out as long as possible (to the latest possible deadline) so far.


I usually use Screaming Frog SEO Spider to find the broken links. Works very well!


That ain't really free plus takes longer time. I have a blog that has over 25000 posts. That's why I wrote my own. Felt like share with others.. :)


Tried to play around with some JSX but unfortunately it keeps throwing the error that React is not defined.

Looks cool though!


The author published a working React & JSX example here that might be useful: https://natto.dev/@paul/1dcbeb7583624bf19d69b9942271a703.

Some background on Twitter too: https://twitter.com/_paulshen/status/1380292393291841541


Would like to try it when it comes out on Android :)


We're planning to start working on it in January, and I think we can get the first version out in late February. If you know good senior Android engineers, send them my way! (charles@bunch.ai, no CV required, let's just chat)


The first link to the PDF unfortunately doesn't work.


I've worked for a company that released a product exactly like this. They went bankrupt 6 months later.

I hope you'll have more success, but at this moment I cannot see anything in there that makes it better than the product that the company I worked for was working on..


Great, another release not supported by Spectron so you can't actually write tests for it.


This post seems very focused on projects with just a single developer. The "recognizing and tagging" versions for example completely breaks down when you have multiple people commit and merging things to master and multiple pipelines running that are trying to recognize and tag versions.


The post is very much not complete -- there is much more a team would have to standardize to fully take a set up like this into production, bu I'm not sure that it "completely breaks down" with more people contributing.

As long as there is some release coordination, a system like this can work. In particular, I've found that the lightest way to get a system like this working for x > 1 developers is to have a release-vX.X.X branch for releases that are going out and vX.X.X tags for releases once they have actually landed.

You mention the recognizing and tagging versions being an issue -- are you imagining a world where two people are releasing something at the same time? I'm not exactly sure

If we assume that what that posts suggests isn't feasible, regardless of how you deploy and do your CI, if your release process can be done by a human, it can likely be automated. Gitlab CI is most robust and yet easiest to understand automated CI system I've seen for making that happen (at least that was my intended point).


This is what interests me a lot.

I have found that there are moments that start to give chaotic behavior in CI that are influenced by the number of people working on mono repository and some other stuff like code maturity: 1 dev, 2-5 devs, 5-10 devs and 10+ devs .....

Something that works great for any number might start to break for other number. Something effective for huge team may be very bad for smaller team. Something that works great in one moment (first several months of the project when thing still get shape 0.X.Y versions) sux when we move to 1.x version and vice versa.

I am not aware of anybody giving more detailed thought about this nor examine what could be most optimal solution for any type of team (if such thing exist) or on what precise moment you start doing certain practices (for example protecting master branch).


Really cool. Would be nice if it had a format option in the editor however. Copy pasting CSS around was becoming a bit annoying because I have to format it manually.


Thanks :-) By "format", do you mean something like Prettier? I'm curious what tool you'd use for CSS?


If you mean indentation, shift-tab should auto-indent the selection


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: