I think it's a good idea to experiment with and discover the limitations of small, untuned models before exposing yourself to the modern very powerful ones. It gives you a better sense of their nature as token predictors and not real sentience.
In the same way, seeing an incompetent stage magician fumble before you see a very talented one perform flawlessly will help you understand that it's all sleight of hand. If you jump right to a professional performance, you might think it's real magic.
Agreed, and you could get a completely unrelated LLM to generate a similar apology without any of the real context, it would make up reasoning just as effectively.
Yes, this isn’t a logical error at all. If you don’t have taste in one area — actually, it’s even worse, you’re not even aware of your own lack of taste — why would I trust your taste in another area?
This is assuming that the owner is also the chef, and exclusively concerned with cooking. Being a restaurateur is a multi-disciplinary job. The owner's job is literally to have good taste in all areas of the restaurant business: food, interior design, hospitality, branding and marketing, etc. No one is saying that they have to be a graphic designer. The obvious answer here was to have the good sense to hire a local college art student for like $300 to make an endearing and meaningful logo.
Probably more likely to run into a Shoggoth than Cthulhu in the Antarctic, or maybe an Elder Thing if any are still around. Rl'yeh is a bit further north.
Regarding the interactions shown in the screenshots:
LLMs are pattern-matching machines. They keep the pattern going. Once "the agent disobeys the human's instructions" has made its way into the context, that is the pattern that it's going to keep matching. No amount of telling it to stop will make it stop.
The only possible solution is excising it from context and replacing it with examples of it doing the right thing. Given that these models have massive context windows now and much of the output is hidden from the user, that's becoming less viable.
It makes the behavior more obvious from simply looking at the file, for one thing, and it means you can just lump it into your next `git add -A` without needing to handle it specially.
I've never been a fan of dual grid, and personally prefer the rpg maker approach of using 5 sample tiles and then chopping up and recombining them to make the 47 tiles needed for what I believe is called "blob tiling".
Yup, email is usually the preferred communication tool of record. In a previous job, our messages on Teams were wiped after 8 days so anything that needed to be recorded had to be in an email or some form of document.
In the same way, seeing an incompetent stage magician fumble before you see a very talented one perform flawlessly will help you understand that it's all sleight of hand. If you jump right to a professional performance, you might think it's real magic.
reply