Jun 20, 2015

英語読み上げのフェスティバルをWindows8.1にインストール

フェスティバルは、英文を読み上げるプログラムで、エジンバラ大学の音声技術中央研究所(The Centre for Speech Technology Research)で開発されています。

フェスティバルは、だれでも無料で利用できる英文読み上げシステムの中では、最高ランクにあります。

最高ランクの理由は、かなり自然に聞こえること、スクリプトで話者を交換できるのでより自然な対話を作れること、そして話者が男女複数人用意されていること。

最新版の2.4が昨年の12月に公開されていたことに気がついたので、Windows8.1にインストールしてみたのでその手順をメモしておきます。


How to use Festival on Cygwin64 on Windows8.1

1. Introduction

 Festival is a general multi-lingual speech synthesis system developed at CSTR.
 http://www.cstr.ed.ac.uk/projects/festival/
 Version 2.4 is released at December 2014.
 I'm going to use Festival on my Windows8.1.

2. Install Cygwin64

 I have installed Cygwin64.
 And modules, such as, g++ (gnu c++ compiler), diff by Cygwin64-installer(setup-x86_64).
 And more, almost all of audio-modules (may be PulseAudio) for the safety.

3. Download and extract Festival sources.

  Download all of *.tar.gz from http://festvox.org/packed/festival/2.4/ and voices.

  I have two directries, festival/ and speech_tools

4. make speech_tools

  #> cd  speech_tools

  #> vi config/systems/x86_64_unknown.mak
  ...(see below)...

  #> cat config/systems/x86_64_unknown.mak
  ## Settings for Cygwin64.
  include $(EST)/config/systems/default.mak
  DEFAULT_JAVA_HOME=/usr/lib/jdk-1.1.6
  gcc=gcc
  ## the native audio module for this type of system
  NATIVE_AUDIO_MODULE = WIN32
  GNUTEST=test
  RM=rm
  ## awk is gawk, so it does all we could desire and then more.
  NAWK=awk
  ## EGCS installs as gcc
  EGCS_CC=gcc
  EGCS_CXX=gcc
  OS_LIBS = -lwinmm -luser32

  #> ./configure
  #> make
  #> make test

5. make festival

  #> ./configure
  #> make
  #> make test

6. try festival

  #> bin/festival

  Festival Speech Synthesis System 2.4:release December 2014
  Copyright (C) University of Edinburgh, 1996-2010. All rights reserved.

  clunits: Copyright (C) University of Edinburgh and CMU 1997-2010
  clustergen_engine: Copyright (C) Carnegie Mellon University 2005-2014
  hts_engine:
  The HMM-Based Speech Synthesis Engine "hts_engine API"
  hts_engine API version 1.07 (http://hts-engine.sourceforge.net/)
  Copyright (C) The HMM-Based Speech Synthesis Engine "hts_engine API"
  Version 1.07 (http://hts-engine.sourceforge.net/)
  Copyright (C) 2001-2012 Nagoya Institute of Technology
                2001-2008 Tokyo Institute of Technology
  All rights reserved.

  All rights reserved.
  For details type `(festival_warranty)'
  festival> (SayText "hello")
  festival> (define utt1 (Utterance Text "hello world"))
  #<Utterance 0x6ffed1586f0>
  festival> (utt.synth utt1)
  #<Utterance 0x6ffed1586f0>
  festival> (utt.save.wave utt1 "name.wav" 'riff)
  #<Utterance 0x6ffed1586f0>
  festival>(ctrl-D)

  #> ls name.wav
  name.wav

  #> cp name.wav /dev/dsp