Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network

  Advanced Search  

Lehrstuhl fr Informatik 9 Zeit: Donnerstag, 7.4.2011 , 16:30 Uhr

Summary: Lehrstuhl für Informatik 9
Zeit: Donnerstag, 7.4.2011 , 16:30 Uhr
Ort: AH I, Ahornstr. 55
Referent: Prof. Themis Palpanas
DISI ­ University of Trento, Italy
Titel: iSAX 2.0: Indexing and Mining One Billion Time Series
There is an increasingly pressing need, by several applications in diverse domains,
for developing techniques able to index and mine very large collections of time se-
ries. Examples of such applications come from astronomy, biology, the web, and oth-
er domains. It is not unusual for these applications to involve numbers of time series
in the order of Hundreds of millions to billions. In this paper, we describe iSAX 2.0, a
data structure designed for indexing and mining truly massive collections of time se-
ries. We show that the main bottleneck in mining such massive datasets is the time
taken to build the index, and we thus introduce a novel bulk loading mechanism, the
first of this kind specifically tailored to a time series index. We show how our method
allows mining on datasets that would otherwise be completely untenable, including
the first published experiments to index one billion time series, and experiments in
mining massive data from domains as diverse as entomology, DNA and web-scale


Source: Ábrahám, Erika - Fachgruppe Informatik, Rheinisch Westfälische Technische Hochschule Aachen (RWTH)


Collections: Computer Technologies and Information Sciences