Gemma SAM MLX

Local vision-guided segmentation on Apple Silicon — Gemma 4 decides what to look for, SAM 3.1 segments it. No cloud, no API.

Reproduces Maziyar Panahi's demo with a native macOS app.

Architecture

┌──────────────────┐                 ┌──────────────────┐
│   SwiftUI App    │◄───────────────►│ FastAPI Backend  │
│  (macOS native)  │ localhost:8199  │   (Python/MLX)   │
└──────────────────┘                 └─────────┬────────┘
                                               │
                                     ┌─────────┴─────────┐
                                     │                   │
                              ┌─────────────┐   ┌────────────────┐
                              │   Gemma 4   │   │    SAM 3.1     │
                              │ (reasoning) │   │ (segmentation) │
                              └─────────────┘   └────────────────┘

Models (auto-downloaded on first run):

Gemma 4: mlx-community/gemma-4-26b-a4b-it-4bit (15.6 GB, MoE 26B/4B active)
SAM 3.1: mlx-community/sam3.1-bf16 (3.49 GB, ~873M params)

Quick Start

# 1. Install Python deps
uv venv && source .venv/bin/activate
uv sync

# 2. Start backend + launch macOS app
./start.sh app

# Or just the backend (for curl/API use)
./start.sh backend

API Endpoints

Endpoint	Method	Description
`/api/status`	GET	Model loading status
`/api/segment`	POST	Segment image with text prompt
`/api/analyze`	POST	Gemma 4 image analysis
`/api/analyze-and-segment`	POST	Auto-analyze then segment

# Example: segment all vehicles
curl -F "image=@photo.jpg" -F "prompt=all vehicles" http://localhost:8199/api/segment

macOS App

Drag an image or video into the app, type a prompt (e.g. "all vehicles"), click Segment. Use Auto Analyze to let Gemma 4 decide what to segment.

Build from Xcode:

cd GemmaSAMApp && xcodegen generate && open GemmaSAMApp.xcodeproj

Requirements

macOS 14+ (Apple Silicon)
Python 3.11+
Xcode 16+ (for SwiftUI app)
~19 GB RAM for model loading

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
GemmaSAMApp		GemmaSAMApp
backend		backend
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
screenshot.png		screenshot.png
start.sh		start.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gemma SAM MLX

Architecture

Quick Start

API Endpoints

macOS App

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Gemma SAM MLX

Architecture

Quick Start

API Endpoints

macOS App

Requirements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages