Hacker Newsnew | past | comments | ask | show | jobs | submit | okanesen's commentslogin

Not that dramatic of an increase actually - the M2 Max already had 400GB/s and M2 Ultra 800GB/s memory bandwidth, so the M3 Ultra's 819GB/s is just a modest bump. Though the M4's additional 146GB/s is indeed a more noticeable improvement.


Also should note that 800/819GB/s of memory bandwidth is actually VERY usable for LLMs. Consider that a 4090 is just a hair above 1000GB/s


Does it work like that though at this larger scale? 512GB of VRAM would be across multiple NVIDIA cards, so the bandwidth and access is parallelized.

But here it looks more of a bottleneck from my (admittedly naive) understanding.


For inference the bandwidth is generally not parallelized because the weights need to go through the model layer by layer. The most common model splitting method is done by assigning each GPU a subset of the LLM layers and it doesn't take much bandwidth to send model weights via PCIE to the next GPU.


My understanding is that the GPU must still load its assigned layer from VRAM into registers and L2 cache for every token, because those aren’t large enough to hold a significant portion. So naively, for a 24GB layer, you‘d need to move up to 24GB for every token.


But the memory bandwidth is only part of the equation; the 4090 is at least several times faster at compute compared to the fastest Apple CPU/GPU.


The codes are all gone. That was fast.

E: Happy to take one, if somebody happens to have a spare one left. Email is in my bio.


Even though I'm using Container Proxy myself, you can almost do the same thing with the official plugin (Manage Containers -> Click any Container -> Advanced proxy settings).


I use Containerise[0] (not affiliated) exactly for that purpose but I agree, this should really be included in the official plugin.

[0]: https://github.com/kintesh/containerise


Probably not exactly what you want but for proxies you can use Container proxy[0] (not affiliated), which allows you to assign different proxies added by yourself to specific containers.

[0]: https://addons.mozilla.org/en-US/firefox/addon/container-pro...


I've used containers with container proxy for well over a year and it works great for segregating work traffic on my home system. I use a container for work traffic that is assigned to an SSH session to my work system that exposes a SOCKS proxy that I make sure is running.

Eventually I moved work browsing to a different browser that I set to run through that proxy explicitly, but I still have the Firefox config set up in case I don't want to launch that, or don't care to copy and paste to the work browser some times (I wish I could find a way to make links launched from Outlook/Teams to open in Chrome but other links to open in Firefox).


If you are on a Mac, have a look at finicky to manage which URLs should open in a specific browser.

https://github.com/johnste/finicky


I'm not, but just yesterday or the day before I was thinking maybe I'd write a simple application to take the URL and make decisions what to do with it afterwards based on whatever data I decide is relevant (time of day, domain, etc). If I had access to what the source application was that would be ideal, but I doubt Windows 10 supplies that.


Minor correction: https://webirc.hackint.org/#irc://irc.hackint.org/noanswers

Without the %23 prefix.


No, not necessarily.

There is a dedicated page under examples, if you're interested: https://codemirror.net/6/examples/bundle/.


Correct, more info on the FAQ[0]:

> What are Ask HN and Show HN?

> Ask HN lists questions and other text submissions. Show HN[1] is for sharing your personal work and has special rules[2].

[0]: https://news.ycombinator.com/newsfaq.html

[1]: https://news.ycombinator.com/show

[2]: https://news.ycombinator.com/showhn.html


Just as a side info: unfortunately this won't work in Germany, since that feature was disabled there.


Wouldn't uninstall, switch to the US app store, and reinstall do the trick?


Why?



needs to be reposted and made a dupe 39 more times, brb


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: