| | |
Summary: In Proceedings of the 4th Symposium on Networked Systems Design
and Implementation (NSDI '07), Cambridge, Massachusetts, April 2007
Exploiting Similarity for MultiSource Downloads
Using File Handprints
Himabindu Pucha, David G. Andersen, Michael Kaminsky
Purdue University, Carnegie Mellon University, Intel Research Pittsburgh
Abstract
Many contemporary approaches for speeding up large
file transfers attempt to download chunks of a data ob
ject from multiple sources. Systems such as BitTorrent
quickly locate sources that have an exact copy of the de
sired object, but they are unable to use sources that serve
similar but nonidentical objects. Other systems automati
cally exploit crossfile similarity by identifying sources
for each chunk of the object. These systems, however,
require a number of lookups proportional to the number
of chunks in the object and a mapping for each unique
chunk in every identical and similar object to its corre
sponding sources. Thus, the lookups and mappings in
such a system can be quite large, limiting its scalability.
|