Abstract
Automatically understanding the plot of novels is important both for informing literary scholarship and applications such as summarization or recommendation. Various models have addressed this task, but their evaluation has remained largely intrinsic and qualitative. Here, we propose a principled and scalable framework leveraging expert-provided semantic tags (e.g., mystery, pirates) to evaluate plot representations in an extrinsic fashion, assessing their ability to produce locally coherent groupings of novels (micro-clusters) in model space. We present a deep recurrent autoencoder model that learns richly structured multi-view plot representations, and show that they i) yield better microclusters than less structured representations; and ii) are interpretable, and thus useful for further literary analysis or labelling of the emerging micro-clusters.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 2017) |
Publisher | Association for Computational Linguistics |
Pages | 1873–1883 |
Number of pages | 11 |
DOIs | |
Publication status | Published - 7 Sep 2017 |
Event | EMNLP 2017: Conference on Empirical Methods in Natural Language Processing - Copenhagen, Denmark Duration: 7 Sep 2017 → 11 Sep 2017 http://emnlp2017.net/index.html http://emnlp2017.net/ |
Conference
Conference | EMNLP 2017: Conference on Empirical Methods in Natural Language Processing |
---|---|
Abbreviated title | EMNLP 2017 |
Country/Territory | Denmark |
City | Copenhagen |
Period | 7/09/17 → 11/09/17 |
Internet address |