Blog

Notes from the team

Thesis pieces, release write-ups, and engineering notes, written by the people building Hakim.

Choosing an Arabic voice AI in 2026, a buyer's guide

How to evaluate Arabic TTS, STT, and voice cloning APIs without the marketing claims. Eleven criteria that actually matter, what to ignore, and how the major providers stack up across MSA and the regional dialect tree.

May 2, 2026By Hakim Team12 min read

EngineeringProduct

Voice agents for Arabic customer service, a 2026 guide

Building Arabic-language voice agents for the Gulf, Egypt, and the Levant. STT dialect coverage, sub-120 ms TTS, interrupt handling, and the four mistakes that sink voice-agent rollouts in MENA.

April 25, 2026By Hakim Engineering11 min read

ProductEngineering

How AI dubbing works for Arabic, and why dialect matters

AI dubbing isn't just translation plus TTS. A breakdown of the seven-stage pipeline, where dialect choice changes everything, lip-sync constraints, and how studios actually integrate AI dubbing into existing post-production workflows.

April 22, 2026By Hakim Team10 min read

ThesisModels

What is Khaleeji Arabic, and why does it matter for voice AI?

Khaleeji isn't one dialect, it's a family. A primer on Najdi, Hejazi, Emirati, Kuwaiti, Qatari, and Bahraini varieties; what voice AI actually needs to handle them; and what 'covered' looks like in production.

April 18, 2026By Hakim Research9 min read

ThesisProduct

Why Arabic-first voice AI

Most voice AI treats Arabic as an afterthought. Here's why we rebuilt the stack with MSA and fifteen dialects as the default, not the fallback.

April 10, 2026By Hakim Team6 min read

EngineeringProduct

Streaming TTS for real-time voice, how it actually works

Behind the sub-120 ms TTFB number: chunked autoregressive decoding, WebSocket transport semantics, mid-stream cancel, and the architectural choices that separate a streaming TTS from a batch TTS labelled streaming.

April 2, 2026By Hakim Engineering11 min read

ModelsProduct

Hakim Fast v1.3, 113 ms end-to-end TTS

A deep dive on the latency budget: model surgery, custom inference, and the four-region topology that gets a synthesis call to your user in 113 milliseconds.

March 20, 2026By Hakim Research5 min read

ModelsThesis

Dialect coverage that actually ships

Fifteen Arabic dialects and six MENA language tiers, evaluated on regional corpora, not press releases. How we measure it, and how we plan to grow it.

February 28, 2026By Hakim Research7 min read

ThesisProduct

Arabic TTS pricing in 2026, how to think about it

What you're actually paying for: per-character vs. per-second, free-tier traps, voice-cloning amortisation, dialect surcharges, and the hidden costs that show up at 100M characters per month.

February 5, 2026By Hakim Team7 min read

ThesisProduct

Voice cloning under the EU AI Act, a 2026 compliance brief

The EU AI Act came into force on 2 February 2025. What changed for voice cloning specifically: transparency disclosures, deepfake labelling, the GPAI obligations, and how the EU regime intersects with the GCC's identity-misuse statutes.

January 20, 2026By Hakim Team9 min read

EngineeringProduct

Why four Arabic regions, not one

Frankfurt, UAE, KSA, Qatar, four full deployments from day one. Why the hub-and-spoke model doesn't work for Arabic voice AI, and what we built instead.

December 5, 2025By Hakim Engineering8 min read

ThesisProduct

Out of stealth, into production

Eighteen months head-down, four founders still on the stack, a production TTS + STT model family, and four live regions. Today we come out of stealth, here's the arc.

November 15, 2025By Hakim Team5 min read

EngineeringProduct

Building bilingual products, an Arabic-English voice strategy

Most MENA-targeted products are bilingual by default. How to design a voice layer that handles AR/EN code-switching gracefully: language detection, prosody at the boundary, voice catalogue ergonomics, and the LLM prompt patterns that don't break.

October 20, 2025By Hakim Engineering9 min read

EngineeringProduct

Voice cloning without the consent horror story

Quality is a single axis. Voice cloning is a four-axis problem: quality, consent, traceability, revocation. Here's how we built each of them, and what we refuse to do.

September 4, 2025By Hakim Team7 min read

EngineeringModels

Research preview to Fast v1.0: what we rewrote

A retrospective on the nine weeks of decoder surgery, the Beirut-Cairo-Riyadh corpus collection, and the boring infra work that separated a notebook demo from a production API.

February 10, 2025By Hakim Research9 min read