What open-source LLMs are you using in 2024?

Blaed@lemmy.world · 2 years ago

What open-source LLMs are you using in 2024?

xodoh74984@lemmy.world · edit-2 2 months ago

deleted by creator

Blaed@lemmy.world · 2 years ago

What sort of tokens per second are you seeing with your hardware? Mind sharing some notes on what you’re running there? Super curious!

Frozen_byte@sffa.community · 2 years ago

I would also be interested in Code-Pilot Models that are reaching for same performance like GitHub or Microsofts paid Models.

Currently I use TabbyML but the available Models are by far inferior.

xodoh74984@lemmy.world · 2 years ago

deleted by creator

Blaed@lemmy.world · 2 years ago

I was pleasantly surprised by many models of the Deepseek family. Verbose, but in a good way? At least that was my experience. Love to see it mentioned here.

🇨🇦Samuel Proulx🇨🇦@rblind.com · 2 years ago

Personally I find myself renting GPU and running Goliath 120b. Smaller models could do what I’m doing if I spent more time optimizing my prompts. But every day I’m doing different tasks, and Goliath 120b will just handle whatever I throw at it, no matter how sloppy I am. I’ve also been playing with LLAVA and Hermes vision models to describe images to me. However, when I really need alt-text for an image I can’t see, I still find myself resorting to GPT4; the open source options just aren’t as accurate or detailed.

wilkinsonwilfrid · edit-2 1 year ago

@slope game Several Deepseek models pleasantly surprised me. Flowing, yet with an air of elegance? I can say that from personal experience. Delighted to see it brought up here.