More

pyryt · 2025-05-01T17:55:04 1746122104

I would love to do this on my codebase after every commit

pyryt · on March 7, 2025

Some names are just too tempting https://arxiv.org/abs/1507.02672

pyryt · on Aug 22, 2024

This looks promising! Does any LLM understand Instantdb yet?

pyryt · on April 30, 2024

For comparison, manslaughter average sentence in Finland seems to be around 9.5 yrs

pyryt · on April 23, 2024

You can get the ball off the ground if you turn your phone upside down. Then you can just sort of fly over the places where you'd normally fall. Takes maybe a minute or two to complete all the levels.

pyryt · on Feb 9, 2024

Ive enjoyed the classical sales books like SPIN selling, Challenger sale, Fanatical prospecting, The Psychology of Selling.

pyryt · on Jan 29, 2024

Has anyone experimented with integrating real-time lipsync into a low-latency audio bot? I saw some demos with d-id but their pricing was closer to $1/minute which makes it rather prohibitive

pyryt · on Jan 29, 2024

Interesting project, thanks for sharing

pyryt · on Jan 29, 2024

Knowing when to speak is actually a prediction task in itself. See eg https://arxiv.org/abs/2010.10874

Would be indeed great to get something like this integrated with whisper, LLM and TTS

zachthewf · on Jan 29, 2024

Hard for me to imagine that this could be solved in text space. I think the prediction task needs to be done on the audio.

stiffler01 · on Jan 29, 2024

We thought about doing this in Whisper itself, since its already working in the audio space.

stiffler01 · on Jan 29, 2024

Yes, this is something we want to look into in more detail, really appreciate sharing the research.

pyryt · on Dec 12, 2023

Do you have use case demo videos somewhere? Would be great to see this in action

ralfelfving · on Dec 12, 2023

There's one at 00:30 in this YouTube video (timestamped the link): https://www.youtube.com/watch?v=1IdCWqTZLyA&t=32s