Masaya Kawamura
  • Experience
  • Publication
  • Award & Grants
  • Talks
  • Publications
    • Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
    • LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
    • PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions
    • Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
    • 混合Differentiable Digital Signal Processingモデルによる合成パラメータ抽出のためのラウドネスの時間変動に基づくロス関数の設計
    • Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds
    • 混合Differentiable DSPモデルによる混合楽器音からの合成パラメータ抽出の実験的評価
    • 楽譜情報を援用した音楽音響信号に対する混合Differentiable DSPモデルの合成パラメータ推定
    • Contrastive Response Pairs for Automatic Evaluation of Non-task-oriented Neural Conversational Models
    • ニューラル対話モデルの自動評価に向けた対照応答対評価セットの試作
  • Experience
  • Talks
    • ICASSP2023音声・音響読み会
    • ICASSP2022音声読み会
  • Award & Grants
    • IEEE Signal Processing Society Japan Student Conference Paper Award
    • Google Travel and Conference Grants

ICASSP2023音声・音響読み会

Jul 7, 2023 · 1 min read
cite
Date
Jul 7, 2023 6:00 PM — 8:00 PM
Event
ICASSP2023音声・音響読み会

タイトル:Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Last updated on Jul 7, 2023
Masaya Kawamura
Authors
Masaya Kawamura

ICASSP2022音声読み会 Jun 10, 2022 →

© 2025 Masaya Kawamura. This work is licensed under CC BY NC ND 4.0

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.