Hey all, I'm from Pinecone (shocker). Addressing common questions... What are ve...

znagengast · on May 5, 2023

How are you guys thinking about the embedding generation side of things? It seems like that part has a generally hefty compute cost before it even gets into the index - I just open sourced a swift package to try to make that part as easy as possible, the example project exports directly to pinecone. https://github.com/ZachNagengast/similarity-search-kit

jmole · on May 5, 2023

I know nothing about vector databases – is this just “replace SQL with a dot product and return a ranked list (with optimizations)”?

kirill5pol · on May 5, 2023

That’s nearest neighbour search which scales O(n^2) for the number of vectors in your DB, what these DBs (and libraries like FAISS) use is approximate nearest neighbour, which makes the search much, much faster.