Abstract
An important topic in unit selection based speech synthesis is the scalability of such systems. Related to this problem is the question regarding the optimal size of a unit selection database. An ideal system should produce ever better synthesis results when more data is added to the system, but for a practical system this might not be the case. The unit selection criteria are generally not sufficiently developed to ensure that a system makes an optimal use of the data that it has available.
In this paper we propose a database reduction technique based on the statistical behaviour of unit selection. We investigate the effect of scaling down the database by objective and subjective criteria. We compare the proposed reduction technique with a technique that simply limits the size of unit lists to a fraction of their original size (random removal).
The results show that the proposed technique is far better than random removal, and that we can remove a significant portion of our database without causing any severe quality loss.
In this paper we propose a database reduction technique based on the statistical behaviour of unit selection. We investigate the effect of scaling down the database by objective and subjective criteria. We compare the proposed reduction technique with a technique that simply limits the size of unit lists to a fraction of their original size (random removal).
The results show that the proposed technique is far better than random removal, and that we can remove a significant portion of our database without causing any severe quality loss.
Original language | English |
---|---|
Title of host publication | 7th International Conference on Spoken Language Processing (ICSLP2002), Proceedings of the |
Subtitle of host publication | Interspeech 2002 |
Editors | John H. L. Hansen, Bryan Pellom |
Publisher | ISCA |
Number of pages | 4 |
Publication status | Published - 2002 |
Event | 7th International Conference on Spoken Language Processing (Interspeech 2002) - Denver, CO, United States Duration: 16 Sep 2002 → 20 Sep 2002 |
Conference
Conference | 7th International Conference on Spoken Language Processing (Interspeech 2002) |
---|---|
Country/Territory | United States |
City | Denver, CO |
Period | 16/09/02 → 20/09/02 |