Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network

  Advanced Search  

Challenges and Opportunities in Internet Data Mining David G. Andersen

Summary: Challenges and Opportunities in Internet Data Mining
David G. Andersen
Carnegie Mellon University
Nick Feamster
Georgia Institute of Technology
Jan 2006
Parallel Data Laboratory
Carnegie Mellon University
Pittsburgh, PA 15213-3890
Internet measurement data provides the foundation for the operation and planning of the networks that comprise the Internet, and
is a necessary component in research for analysis, simulation, and emulation. Despite its critical role, however, the management of
this data--from collection and transmission to storage and its use within applications--remains primarily ad hoc, using techniques
created and re-created by each corporation or researcher that uses the data. This paper examines several of the challenges faced
when attempting to collect and archive large volumes of network measurement data, and outlines an architecture for an Internet
data repository--the datapository--designed to create a framework for collaboratively addressing these challenges.
Keywords: network monitoring, databases, data management, data mining
1 Introduction
Communications networks produce vast amounts of data that both help network operators manage and plan


Source: Andersen, Dave - School of Computer Science, Carnegie Mellon University
Carnegie Mellon University, Department of Electrical and Computer Engineering, Parallel Data Lab.


Collections: Computer Technologies and Information Sciences