Web Scraping of Online Newspapers via Image Matching

D. Moltisanti, G. M. Farinella, S. Battiato, G. Giuffrida

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Reading is an activity which takes place widely on the web: almost all newspapers have his own digital version on the internet and there are even a lot of magazines only on the web. In such a scenario, Computer Vision can offer a useful set of tools that can help web editors to improve the quality of the provided service. One of these tools is here presented: given a webpage of a newspaper or journal, the proposed framework localizes news items remotely clicked by users, giving the bounding box of the content of an article in its relative homepage. The tool is hence able to track an article in the page in which is contained at any time during the day: such an information is very useful for web editors to understand the trend of the published items and to rearrange the contents of the homepage accordingly.
Original languageEnglish
Title of host publicationProgress in Industrial Mathematics at ECMI 2014
EditorsGiovanni Russo, Vincenzo Capasso, Giuseppe Nicosia, Vittorio Romano
Place of PublicationCham
PublisherSpringer
Pages17-24
Number of pages8
ISBN (Electronic)978-3-319-23413-7
ISBN (Print)978-3-319-79479-2
DOIs
Publication statusPublished - 5 Sept 2017
EventThe 18th European Conference on Mathematics for Industry 2014 - Taormina, Italy
Duration: 9 Jun 201413 Jun 2014
Conference number: 18
https://ecmi2014.taosciences.org/

Publication series

NameMathematics in Industry
PublisherSpringer, Cham
Volume22
ISSN (Print)1612-3956
ISSN (Electronic)2198-3283

Conference

ConferenceThe 18th European Conference on Mathematics for Industry 2014
Abbreviated titleECMI 2014
Country/TerritoryItaly
CityTaormina
Period9/06/1413/06/14
Internet address

Keywords / Materials (for Non-textual outputs)

  • Computer vision
  • Image matching

Fingerprint

Dive into the research topics of 'Web Scraping of Online Newspapers via Image Matching'. Together they form a unique fingerprint.

Cite this