LM Studio has quickly become the best way to run local LLMs on an Apple Silicon ...

zackify · 2025-06-25T19:45:37 1750880737

Ollama doesn’t even have a way to customize the context size per model and persist it. LM studio does :)

Anaphylaxis · 2025-06-25T22:02:57 1750888977

This isn't true. You can `ollama run {model}`, `/set parameter num_ctx {ctx}` and then `/save`. Recommended to `/save {model}:{ctx}` to persist on model update

truemotive · 2025-06-26T10:28:58 1750933738

This can be done with custom Modelfiles as well, I was pretty bent when I found out that 2048 was the default context length.

https://ollama.readthedocs.io/en/modelfile/

zackify · 2025-06-26T11:33:38 1750937618

As of 2 weeks back if I did this, it would reset back the moment cline made an api call. But lm studio would work correctly. I’ll have to try again. Even confirmed cline was not overriding num context

pzo · 2025-06-25T18:38:44 1750876724

I just wish they did some facelifting of UI. Right now is too colorfull for me and many different shades of similar colors. I wish they copy some color pallet from google ai studio or from trae or pycharm.

chisleu · 2025-06-25T18:53:59 1750877639

> I'm not bullish on MCP

You gotta help me out. What do you see holding it back?

minimaxir · 2025-06-25T19:29:23 1750879763

tl;dr the current hype around it is a solution looking for a problem and at a high level, it's just a rebrand of the Tools paradigm.

mhast · 2025-06-25T19:37:10 1750880230

It's "Tools as a service", so it's really trying to make tool calling easier to use.

ijk · 2025-06-25T22:01:54 1750888914

Near as I can tell it's supposed to make calling other people's tools easier. But I don't want to spin up an entire server to invoke a calculator. So far it seems to make building my own local tools harder, unless there's some guidebook I'm missing.

cchance · 2025-06-26T04:02:47 1750910567

Your not spinning up a whole server lol, most MCP's can be run locally, and talked to over stdio, like their just apps that the LLM can call, what they talk to or do is up to the MCP writer, its easier to have a MCP that communicates what it can do and handles the back and forth, than writing a non-standard middleware to handle say calls to an API or handle using applescript, or vmware or something else...

ijk · 2025-06-26T08:01:36 1750924896

I wish the documentation was clearer on that point; I went looking through their site and didn't see any examples that weren't oversimplified REST API calls. I imagine they might have updated it since then, or I missed something.

xyc · 2025-06-25T23:13:54 1750893234

It's a protocol that doesn't dictate how you are calling the tool. You can use in-memory transport without needing to spin up a server. Your tool can just be a function, but with the flexibility of serving to other clients.

ijk · 2025-06-26T08:03:46 1750925026

Are there any examples of that? All the documentation I saw seemed to be about building an MCP server, with very little about connecting an existing inference infrastructure to local functions.

xyc · 2025-06-27T17:29:49 1751045389

For TypeScript you can refer to https://github.com/modelcontextprotocol/typescript-sdk/blob/...

There isn't much documentation available right now but you can ask coding agent eg. Claude Code to generate an example.

nix0n · 2025-06-25T18:10:01 1750875001

LM Studio is quite good on Windows with Nvidia RTX also.

boredemployee · 2025-06-26T01:58:50 1750903130

care to elaborate? i have rtx 4070 12gb vram + 64gb ram, i wonder what models I can run with it. Anything useful?

Eupolemos · 2025-06-28T14:55:15 1751122515

If you go to huggingface.co, you can tell it what specs you have and when you go to a model, it'll show you what variations of that model are likely to run well.

So if you go to this[0] random model, on the right there is a list of quantifications based on bits, and those you can run will be shown in green.

[0] https://huggingface.co/unsloth/Mistral-Small-3.1-24B-Instruc...

nix0n · 2025-06-26T14:50:36 1750949436

LM Studio's model search is pretty good at showing what models will fit in your VRAM.

For my 16gb of VRAM, those models do not include anything that's good at coding, even when I provide the API documents via PDF upload (another thing that LM Studio makes easy).

So, not really, but LM Studio at least makes it easier to find that out.

boredemployee · 2025-06-26T18:27:34 1750962454

ok, ty for the reply!