Where can I find anonymized access logs (clickstream data)?

asked 09 Feb '11, 17:59

melipone's gravatar image

accept rate: 0%

edited 09 Feb '11, 18:02

rgrp's gravatar image

rgrp ♦♦

Slightly off-topic perhaps, but still maybe relevant?, the ICWSM 2011 Data Challenge has a dataset from Spinn3r.com "consist[ing] of over 386 million blog posts, news articles, classifieds, forum posts and social media content between January 13th and February 14th. The content includes the syndicated text, its original HTML as found on the web, annotations and metadata (e.g., author information, time of publication and source URL), and boilerplate/chrome extracted content ... [and contains] over 133 million blog posts and 231 million social media publication[s], [at] a size of ~3 TB (2.1 TB compressed)."


answered 23 Feb '11, 20:46

psychemedia's gravatar image

psychemedia ♦♦
accept rate: 11%

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here



Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported



Asked: 09 Feb '11, 17:59

Seen: 1,411 times

Last updated: 23 Feb '11, 20:46

powered by OSQA