Skip to content

Releases: huggingface/text-generation-inference

v3.3.6

17 Sep 00:48
efb94e0

Choose a tag to compare

What's Changed

  • Add missing backslash by @philsupertramp in #3311
  • Revert "feat: bump flake including transformers and huggingface_hub versions" by @drbh in #3323
  • fix: remove azure by @drbh in #3325
  • Fix mask passed to flashinfer by @danieldk in #3324
  • Update iframe sources for streaming demo by @coyotte508 in #3327
  • Revert "Revert "feat: bump flake including transformers and huggingfa… by @drbh in #3326
  • Revert "feat: bump flake including transformers and huggingface_hub versions" by @drbh in #3330
  • Patch version 3.3.6 by @tengomucho in #3329

New Contributors

Full Changelog: v3.3.5...v3.3.6

v3.3.5

02 Sep 15:02

Choose a tag to compare

What's Changed

Full Changelog: v3.3.4...git

v3.3.4

19 Jun 10:00

Choose a tag to compare

Fix for Neuron models exported with batch_size 1.

What's Changed

  • [gaudi] gemma3 text and vlm model intial support. need to add sliding window … by @sywangyi in #3270
  • Neuron backend fix by @dacorvo in #3273

Full Changelog: v3.3.3...v3.3.4

v3.3.3

18 Jun 13:11

Choose a tag to compare

Neuron backend update.

What's Changed

Full Changelog: v3.3.2...v3.3.3

v3.3.2

30 May 14:20

Choose a tag to compare

Gaudi improvements.

What's Changed

Full Changelog: v3.3.1...v3.3.2

v3.3.1

22 May 07:49

Choose a tag to compare

This release updates TGI to Torch 2.7 and CUDA 12.8.

What's Changed

New Contributors

Full Changelog: v3.3.0...v3.3.1

v3.3.0

09 May 13:57

Choose a tag to compare

Notable changes

  • Prefill chunking for VLMs.

What's Changed

New Contributors

Full Changelog: v3.2.3...v3.3.0

v3.2.3

08 Apr 08:18
a1f3ebe

Choose a tag to compare

Main changes

  • Patching Llama 4

What's Changed

Full Changelog: v3.2.2...v3.2.3

v3.2.2

06 Apr 09:41
c67546f

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v3.2.1...v3.2.2

v3.2.1

18 Mar 14:28
4d28897

Choose a tag to compare

What's Changed

Full Changelog: v3.2.0...v3.2.1