Masaya Kawamura
  • Experience
  • Publication
  • Award & Grants
  • Talks
  • Publications
    • BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing
    • Comparative Analysis of Fast and High-Fidelity Neural Vocoders for Low-Latency Streaming Synthesis in Resource-Constrained Environments
    • SLASH: Self-Supervised Speech Pitch Estimation Leveraging DSP-derived Absolute Pitch
    • Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
    • LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
    • PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions
    • Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
    • 混合Differentiable Digital Signal Processingモデルによる合成パラメータ抽出のためのラウドネスの時間変動に基づくロス関数の設計
    • Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds
    • 混合Differentiable DSPモデルによる混合楽器音からの合成パラメータ抽出の実験的評価
    • 楽譜情報を援用した音楽音響信号に対する混合Differentiable DSPモデルの合成パラメータ推定
    • Contrastive Response Pairs for Automatic Evaluation of Non-task-oriented Neural Conversational Models
    • ニューラル対話モデルの自動評価に向けた対照応答対評価セットの試作
  • Experience
  • Talks
    • ICASSP2023音声・音響読み会
    • ICASSP2022音声読み会
  • Award & Grants
    • IEEE Signal Processing Society Japan Student Conference Paper Award
    • Google Travel and Conference Grants

BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing

Sep 1, 2025·
Masaya Kawamura
Masaya Kawamura
,
Takuya Hasumi, Yuma Shirahata, Ryuichi Yamamoto
· 0 min read
PDF arXiv Demo
Type
Conference paper
Publication
In Proceedings of Interspeech
Last updated on Sep 1, 2025
Interspeech tts speech
Masaya Kawamura
Authors
Masaya Kawamura

Comparative Analysis of Fast and High-Fidelity Neural Vocoders for Low-Latency Streaming Synthesis in Resource-Constrained Environments Sep 1, 2025 →

© 2025 Masaya Kawamura. This work is licensed under CC BY NC ND 4.0

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.