Add Japanese and trilingual text normalization for numbers and symbols #18

yuyun2000 · 2025-05-16T03:19:37Z

Changes

Implemented Japanese text normalization module to handle numbers, symbols and special characters
Added trilingual (presumably Chinese/English/Japanese) text normalization support
Created regex patterns for converting numbers and symbols into pronounceable text
Integrated the new normalization modules into the existing text processing pipeline

Why

This enhancement improves pronunciation accuracy when synthesizing Japanese content and multilingual text containing numbers and symbols, ensuring more natural-sounding speech output across all supported languages.

Testing

Verified correct normalization of various Japanese numerical expressions
Tested with mixed language text containing numbers and special symbols
Confirmed proper pronunciation of normalized text through synthesized audio output
Compared results against expected pronunciations in each language

Streamline and simplify code in the SOLA module for improved readability and maintenance

Refactor SOLA component code

Implement regex-based text normalization functionality to support trilingual (CJE) content processing

Add text normalization for Chinese, Japanese, and English

yuyun2000 and others added 5 commits May 15, 2025 15:17

Refactor SOLA component code

10e4bdf

Streamline and simplify code in the SOLA module for improved readability and maintenance

Merge branch 'dev' into opt/melotts

ebf908a

Merge pull request #1 from yuyun2000/opt/melotts

6a96f35

Refactor SOLA component code

Add text normalization for Chinese, Japanese, and English

74c41a3

Implement regex-based text normalization functionality to support trilingual (CJE) content processing

Merge pull request #2 from yuyun2000/opt/melotts

0619178

Add text normalization for Chinese, Japanese, and English

Abandon-ht merged commit e479b19 into m5stack:dev May 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Japanese and trilingual text normalization for numbers and symbols #18

Add Japanese and trilingual text normalization for numbers and symbols #18

yuyun2000 commented May 16, 2025

Add Japanese and trilingual text normalization for numbers and symbols #18

Add Japanese and trilingual text normalization for numbers and symbols #18

Conversation

yuyun2000 commented May 16, 2025

Changes

Why

Testing