Knowledge Discovery Laboratory Knowledge Discovery Laboratory
Dataset: Can-o-sleep
Home

People

Publications

Projects

Software

Data

Data model
About attributes
Databases
   HEP-Th
   Can-o-sleep
   Mobile Social Networks
   DBLP

News

Data characteristics:
  • over 500,000 objects
  • over 6 million links
  • 14 object attributes
  • 6 link attributes
Additional information:
Named for the file-sharing application used, the data in the can-o-sleep database were collected from a campus network for P2P file sharing based on the OpenNap server. The data consist of records of all the mp3 files shared by and transferred between users during an 81-day period between February 28, 2003 and May 21, 2003. Users were uniquely identified by an anonymous MD5 hash. No personal information was collected during this study and users gave explicit consent to anonymous collection of the data.

See Creating Social Networks to Improve Peer-to-Peer Networking for a description of using knowledge discovery techniques on this data to guide the creation of an efficient overlay network for peer-to-peer file sharing.

See the README for additional information on the can-o-sleep database.

Acknowledgments:
Please include the following acknowledgment in all publications that describe work using this database:

The PROXIMITY can-o-sleep database is based on data collected by the Privacy, Internetworking, Security, and Mobile Systems Laboratory at the University of Massachusetts Amherst with additional preparation by the Knowledge Discovery Laboratory, University of Massachusetts Amherst.
Preparation of the PROXIMITY can-o-sleep database was supported by Lawrence Livermore National Laboratory and the Department of Energy under contract number W7405-ENG-48.
FeedbackPrivacyDisclaimer