default search action
Masami Akamine
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2016
- [j11]Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Masami Akamine:
Statistical Bandwidth Extension for Speech Synthesis Based on Gaussian Mixture Model with Sub-Band Basis Spectrum Model. IEICE Trans. Inf. Syst. 99-D(10): 2481-2489 (2016) - [j10]Thomas Drugman, Yannis Stylianou, Yusuke Kida, Masami Akamine:
Voice Activity Detection: Merging Source and Filter-based Information. IEEE Signal Process. Lett. 23(2): 252-256 (2016) - [j9]Tudor-Catalin Zorila, Yannis Stylianou, Tatsuma Ishihara, Masami Akamine:
Near and Far Field Speech-in-Noise Intelligibility Improvements Based on a Time-Frequency Energy Reallocation Approach. IEEE ACM Trans. Audio Speech Lang. Process. 24(10): 1808-1818 (2016) - 2014
- [j8]Ranniery Maia, Masami Akamine:
On the impact of excitation and spectral parameters for expressive statistical parametric speech synthesis. Comput. Speech Lang. 28(5): 1209-1232 (2014) - [j7]Vincent Wan, Javier Latorre, Kayoko Yanagisawa, Norbert Braunschweiler, Langzhou Chen, Mark J. F. Gales, Masami Akamine:
Building HMM-TTS Voices on Diverse Data. IEEE J. Sel. Top. Signal Process. 8(2): 296-306 (2014) - [j6]Langzhou Chen, Mark J. F. Gales, Norbert Braunschweiler, Masami Akamine, Kate M. Knill:
Integrated Expression Prediction and Speech Synthesis From Text. IEEE J. Sel. Top. Signal Process. 8(2): 323-335 (2014) - 2013
- [j5]Ranniery Maia, Masami Akamine, Mark J. F. Gales:
Complex cepstrum for statistical parametric speech synthesis. Speech Commun. 55(5): 606-618 (2013) - 2012
- [j4]Masami Akamine, Jitendra Ajmera:
Decision tree-based acoustic models for speech recognition. EURASIP J. Audio Speech Music. Process. 2012: 10 (2012) - 2011
- [j3]Masami Akamine, Jitendra Ajmera:
Decision Tree-Based Acoustic Models for Speech Recognition with Improved Smoothness. IEICE Trans. Inf. Syst. 94-D(11): 2250-2258 (2011) - 2007
- [j2]Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine, Yoshinori Shiga:
An F0 contour control model using an F0 contour codebook. Syst. Comput. Jpn. 38(1): 62-72 (2007) - 1999
- [j1]Takehiko Kagoshima, Masami Akamine:
Automatic generation of synthesis units by unit selection based on closed-loop training. Syst. Comput. Jpn. 30(9): 1-7 (1999)
Conference and Workshop Papers
- 2019
- [c39]Kenji Iwata, Takami Yoshida, Hiroshi Fujimura, Masami Akamine:
Transfer Learning for Unseen Slots in End-to-End Dialogue State Tracking. IWSDS 2019: 53-65 - 2018
- [c38]Takami Yoshida, Kenji Iwata, Hiroshi Fujimura, Masami Akamine:
Dialog State Tracking for Unseen Values Using an Extended Attention Mechanism. IWSDS 2018: 77-89 - [c37]Yuka Kobayashi, Takami Yoshida, Kenji Iwata, Hiroshi Fujimura, Masami Akamine:
Out-of-Domain Slot Value Detection for Spoken Dialogue Systems with Context Information. SLT 2018: 854-861 - 2015
- [c36]Yamato Ohtani, Yu Nasu, Masahiro Morita, Masami Akamine:
Emotional transplant in statistical speech synthesis based on emotion additive model. INTERSPEECH 2015: 274-278 - [c35]Ranniery Maia, Yannis Stylianou, Masami Akamine:
A maximum likelihood approach to the detection of moments of maximum excitation and its application to high-quality speech parameterization. INTERSPEECH 2015: 603-607 - 2014
- [c34]Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Masami Akamine:
GMM-based bandwidth extension using sub-band basis spectrum model. INTERSPEECH 2014: 2489-2493 - 2013
- [c33]Javier Latorre, Mark J. F. Gales, Kate M. Knill, Masami Akamine:
Training a supra-segmental parametric F0 model without interpolating F0. ICASSP 2013: 6880-6884 - [c32]Ranniery Maia, Masami Akamine, Mark J. F. Gales:
Complex cepstrum analysis based on the minimum mean squared error. ICASSP 2013: 7972-7976 - [c31]Langzhou Chen, Mark J. F. Gales, Norbert Braunschweiler, Masami Akamine, Kate M. Knill:
Integrated automatic expression prediction and speech synthesis from text. ICASSP 2013: 7977-7981 - [c30]Ranniery Maia, Mark J. F. Gales, Yannis Stylianou, Masami Akamine:
Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis. INTERSPEECH 2013: 2336-2340 - [c29]Vincent Wan, Robert Anderson, Art Blokland, Norbert Braunschweiler, Langzhou Chen, BalaKrishna Kolluru, Javier Latorre, Ranniery Maia, Björn Stenger, Kayoko Yanagisawa, Yannis Stylianou, Masami Akamine, Mark J. F. Gales, Roberto Cipolla:
Photo-realistic expressive text to talking head synthesis. INTERSPEECH 2013: 2667-2669 - 2012
- [c28]Ranniery Maia, Masami Akamine, Mark J. F. Gales:
Complex cepstrum as phase information in statistical parametric speech synthesis. ICASSP 2012: 4581-4584 - [c27]Langzhou Chen, Mark J. F. Gales, Vincent Wan, Javier Latorre, Masami Akamine:
Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training. INTERSPEECH 2012: 959-962 - [c26]Javier Latorre, Vincent Wan, Mark J. F. Gales, Langzhou Chen, K. K. Chin, Kate M. Knill, Masami Akamine:
Speech factorization for HMM-TTS based on cluster adaptive training. INTERSPEECH 2012: 971-974 - [c25]Vincent Wan, Javier Latorre, K. K. Chin, Langzhou Chen, Mark J. F. Gales, Heiga Zen, Kate M. Knill, Masami Akamine:
Combining multiple high quality corpora for improving HMM-TTS. INTERSPEECH 2012: 1135-1138 - [c24]Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima, Masami Akamine:
Histogram-based spectral equalization for HMM-based speech synthesis using mel-LSP. INTERSPEECH 2012: 1155-1158 - [c23]Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima, Masami Akamine:
HMM-based speech synthesis using sub-band basis spectrum model. INTERSPEECH 2012: 1440-1443 - 2011
- [c22]Javier Latorre, Mark J. F. Gales, Sabine Buchholz, Kate M. Knill, Masatsune Tamura, Yamato Ohtani, Masami Akamine:
Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification? ICASSP 2011: 4724-4727 - [c21]Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima, Masami Akamine:
One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model. ICASSP 2011: 5124-5127 - 2010
- [c20]Yusuke Shinohara, Takashi Masuko, Masami Akamine:
Covariance clustering on Riemannian manifolds for acoustic model compression. ICASSP 2010: 4326-4329 - [c19]Masatsune Tamura, Norbert Braunschweiler, Takehiko Kagoshima, Masami Akamine:
Unit selection speech synthesis using multiple speech units at non-adjacent segments for prosody and waveform generation. ICASSP 2010: 4802-4805 - [c18]Masatsune Tamura, Takehiko Kagoshima, Masami Akamine:
Sub-band basis spectrum model for pitch-synchronous log-spectrum and phase based on approximation of sparse coding. INTERSPEECH 2010: 2406-2409 - 2009
- [c17]Yusuke Shinohara, Masami Akamine:
Bayesian feature enhancement using a mixture of unscented transformation for uncertainty decoding of noisy speech. ICASSP 2009: 4569-4572 - [c16]Jitendra Ajmera, Masami Akamine:
Decision tree acoustic models for ASR. INTERSPEECH 2009: 1403-1406 - [c15]Javier Latorre, Sergio Gracia, Masami Akamine:
Feedback loop for prosody prediction in concatenative speech synthesis. INTERSPEECH 2009: 2067-2070 - 2008
- [c14]Yusuke Shinohara, Takashi Masuko, Masami Akamine:
Feature enhancement by speaker-normalized splice for robust speech recognition. ICASSP 2008: 4881-4884 - [c13]Hongfei Ding, Koichi Yamamoto, Masami Akamine:
Comparative evaluation of different methods for voice activity detection. INTERSPEECH 2008: 107-110 - [c12]Jitendra Ajmera, Masami Akamine:
Speech recognition using soft decision trees. INTERSPEECH 2008: 940-943 - [c11]Javier Latorre, Masami Akamine:
Multilevel parametric-base F0 model for speech synthesis. INTERSPEECH 2008: 2274-2277 - 2007
- [c10]Remco Teunen, Masami Akamine:
HMM-based speech recognition using decision trees instead of GMMs. INTERSPEECH 2007: 2097-2100 - 1999
- [c9]Tadashi Amada, Kimio Miseki, Masami Akamine:
CELP speech coding based on an adaptive pulse position codebook. ICASSP 1999: 13-16 - [c8]Chang K. Suh, Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine:
Toshiba English text-to-speech synthesizer (TESS). EUROSPEECH 1999: 2111-2114 - 1998
- [c7]Masahiro Oshikiri, Masami Akamine:
A 2.4 kbps variable bit rate ADP-CELP speech coder. ICASSP 1998: 517-520 - [c6]Masami Akamine, Takehiko Kagoshima:
Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS). ICSLP 1998 - [c5]Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine:
An F0 contour control model for totally speaker driven text to speech system. ICSLP 1998 - [c4]Shigenobu Seto, Masahiro Morita, Takehiko Kagoshima, Masami Akamine:
Automatic rule generation for linguistic features analysis using inductive learning technique: linguistic features analysis in TOS drive TTS system. ICSLP 1998 - 1997
- [c3]Takehiko Kagoshima, Masami Akamine:
Automatic generation of speech synthesis units based on closed loop training. ICASSP 1997: 963-966 - 1991
- [c2]Kimio Miseki, Masami Akamine:
Adaptive bit-allocation between the pole-zero synthesis filter and excitation in CELP. ICASSP 1991: 229-232 - 1990
- [c1]Masami Akamine, Kimio Miseki:
CELP coding with an adaptive density pulse excitation model. ICASSP 1990: 29-32
Informal and Other Publications
- 2019
- [i1]Thomas Drugman, Yannis Stylianou, Yusuke Kida, Masami Akamine:
Voice Activity Detection: Merging Source and Filter-based Information. CoRR abs/1903.02844 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-20 00:42 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint