The SUMMA Platform: A Scalable Infrastructure for Multi-lingual Multi-media Monitoring

Ulrich Germann, Renars Liepins, Guntis Barzdins, Didzis Gosko, Sebastiao Miranda, David Nogueira

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The open-source SUMMA Platform is a highly scalable distributed architecture for monitoring a large number of media broadcasts in parallel, with a lag behind actual broadcast time of at most a few minutes.

The Platform offers a fully automated media ingestion pipeline capable of recording live broadcasts, detection and transcription of spoken content, translation of all text (original or transcribed) into English, recognition and linking of Named Entities, topic detection, clustering and crosslingual multi-document summarization of related media items, and last but not least, extraction and storage of factual claims in these news items. Browser-based graphical user interfaces provide humans with aggregated information as well as structured access to individual news items stored in the Platform’s database.

This paper describes the intended use cases and provides an overview over the system’s implementation.
Original languageEnglish
Title of host publicationProceedings of the 56th Annual Meeting of the Association for Computational Linguistics
Subtitle of host publicationDemo Session
Place of PublicationMelbourne, Australia
PublisherAssociation for Computational Linguistics
Pages99-104
Number of pages6
Publication statusPublished - Jul 2018
Event56th Annual Meeting of the Association for Computational Linguistics - Melbourne Convention and Exhibition Centre, Melbourne, Australia
Duration: 15 Jul 201820 Jul 2018
http://acl2018.org/

Conference

Conference56th Annual Meeting of the Association for Computational Linguistics
Abbreviated titleACL 2018
Country/TerritoryAustralia
CityMelbourne
Period15/07/1820/07/18
Internet address

Fingerprint

Dive into the research topics of 'The SUMMA Platform: A Scalable Infrastructure for Multi-lingual Multi-media Monitoring'. Together they form a unique fingerprint.

Cite this