Projects per year
A broadcast news stream consists of a number of stories and it is an important task to find the boundaries of stories automatically in news analysis. We capture the topic structure using a hierarchical model based on a Recurrent Neural Network (RNN) sentence modeling layer and a bidirectional Long Short-Term Memory (LSTM) topic modeling layer, with a fusion of acoustic and lexical features. Both features are accumulated with RNNs and trained jointly within the model to be fused at the sentence level. We conduct experiments on the topic detection and tracking (TDT4) task comparing combinations of two modalities trained with limited amount of parallel data. Further we utilize additional sufficient text data for training to polish our model. Experimental results indicate that the hierarchical RNN topic modeling takes advantage of the fusion scheme, especially with additional text training data, with a higher F1-measure compared to conventional state-of-the-art methods.
|Title of host publication||2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)|
|Publisher||Institute of Electrical and Electronics Engineers (IEEE)|
|Number of pages||9|
|Publication status||Published - 25 Jan 2018|
|Event||2017 IEEE Automatic Speech Recognition and Understanding Workshop - Okinawa, Japan|
Duration: 16 Dec 2017 → 20 Dec 2017
|Conference||2017 IEEE Automatic Speech Recognition and Understanding Workshop|
|Abbreviated title||ASRU 2017|
|Period||16/12/17 → 20/12/17|
FingerprintDive into the research topics of 'Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features'. Together they form a unique fingerprint.
- 1 Finished
SUMMA - Scalable Understanding of Mulitingual Media
Renals, S., Birch-Mayne, A. & Cohen, S.
1/02/16 → 31/01/19