GB-R: A Fast and Eective Gray-Box Reconstruction of Cascade Time-Series

Details

Journal: IEEE Conference
Date: April 15, 2018
DOI: 10.1109/ICDMW.2017.70
Category: Scientific Research

Description

Researchers led by Hyun Ah Song, of the Machine Learning Department at Carnegie Mellon University, developed an algorithm that effectively reconstructs time series counts from aggregated reports by careful infusion of domain knowledge, when compared with Project Tycho data.

Authors

Hyun Ah Song
Fan Yang
Zongge Liu
Wilbert van Panhuis
Nicholas Sidiropoulos
Christos Faloutsos
Vladimir Zadorozhny

Related Project Tycho Datasets

Abstract

Given some (but not all) monthly totals of people with measles (or counts of product-units sold, or counts of retweets), how can we recover the weekly counts? Requiring smoothness between successive weeks is reasonable - but can we do better, if we have some domain knowledge? For example, we know that measles (flu, count-of-retweets, etc) follow a specific cascade model, like the so-called 'SIS'. The answer is 'yes'. With our proposed GB-R we show how to inject domain knowledge, creating a gray-box model; we show how to set up and efficiently solve the appropriate optimization problem. The desirable properties of our GB-R are: (a) effectiveness, outperforming the best competitors on real, epidemiology data, often by 3x - 25x in reconstruction error; (b) scalability, being linear on the sequence length and (c) interpretability, accurately estimating the parameters of the gray-box model.

Read the full article

GB-R: A Fast and Eective Gray-Box Reconstruction of Cascade Time-Series

Details

Description

About Tycho

Contact info

Recent Posts

About the site