Analysis and Forecasting of Web Content Dynamics

Maria Carla Calzarossa, Daniele Tessera

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Web content changes have a strong impact on search engines and more generally on technologies dealing with content retrieval and management. These technologies have to take account of the temporal patterns of these changes and adjust their crawling policies accordingly. This paper presents a methodological framework – based on time series analysis – for modeling and predicting the dynamics of the content changes. To test this framework, we analyze the content of three major news websites whose change patterns are characterized by large fluctuations and significant differences across days and hours. The classical decomposition of the observed time series into trend, seasonal and irregular components is applied to identify the weekly and daily patterns as well as the remaining fluctuations. The corresponding models are used for predicting the future dynamics of the sites based on their current and historical behavior.
Original languageEnglish
Title of host publication2018 32nd International Conference on Advanced Information Networking and Applications Workshops
Pages12-17
Number of pages6
DOIs
Publication statusPublished - 2018
Event2018 32nd International Conference on Advanced Information Networking and Applications Workshops - Kraków
Duration: 16 May 201818 May 2018

Conference

Conference2018 32nd International Conference on Advanced Information Networking and Applications Workshops
CityKraków
Period16/5/1818/5/18

Keywords

  • Forecasting
  • web content dynamics

Fingerprint

Dive into the research topics of 'Analysis and Forecasting of Web Content Dynamics'. Together they form a unique fingerprint.

Cite this