Little-known Homebrew tip: Python applications are installed in their own Homebrew-managed virtual environment, which means you can run pip in them to upgrade to the latest version of packages even when the tap hasn't been updated yet.
I just tried this and it worked - now "vd" gives my Visidata 2.0 on my system:
My main use case for this is: I get some random data file thrown at me. Could be json, csv, xlsx anything that has data that can somehow be interpreted somewhat tabular and I want to find out quickly what's going on.
You can really go from not knowing what's in a file to a decent understanding in less than a minute.
Here's a quick example from today of why I find it such a general purpose tool
Pipe in some fixed-width formatted data
kubectl get pods | vd -f fixed -
Gives me a searchable, usable output. Press right to highlight the status column, shift-f and I get a count (+histogram) of the statuses. Three have failed. Hit enter on that row and I can see specifically which ones have failed. This takes a few seconds (I just tried and did it under 5 from running the above command).
If I knew I wanted the failed ones I could have grepped or similar but I can then order by most restarts, etc.
This is a tiny part of the functionality but it's an example of "there's some tabular-ish data but what's there?". Load X, histogram, filter, sort is probably 90% of my use-cases.
Recently I have been working a lot with semantic sentence encoding. For example for some semantic search use cases or semantic clustering.
As a sort of demo and way to explore the potential and limitations of semantic encoding, I thought it would be fun to build this small side project together with a colleague.
It's basically a HN clone based of VueJS' HN clone with the small twist of having the top 3 semantically most similar together with the current HN stories.
I have been using this for some time for myself now and I find sometimes some fun stories come up. It doesn't always work so great, but see for yourself.
I added a feature for ranking how similar you perceive the story to be. If enough people make use of the ranking feature it could be possible that a dataset of similar HN-stories title pairs would come together (I'm thinking of sth. like the GLUE STS-B dataset).
If that was the case I would, of course, make it available publicly.
Thanks for reading and please let me know your thoughts :)