Where can I find anonymized access logs (clickstream data)?

asked 09 Feb '11, 17:59

melipone's gravatar image

melipone
1223
accept rate: 0%

edited 09 Feb '11, 18:02

rgrp's gravatar image

rgrp ♦♦
501122027


Slightly off-topic perhaps, but still maybe relevant?, the ICWSM 2011 Data Challenge has a dataset from Spinn3r.com "consist[ing] of over 386 million blog posts, news articles, classifieds, forum posts and social media content between January 13th and February 14th. The content includes the syndicated text, its original HTML as found on the web, annotations and metadata (e.g., author information, time of publication and source URL), and boilerplate/chrome extracted content ... [and contains] over 133 million blog posts and 231 million social media publication[s], [at] a size of ~3 TB (2.1 TB compressed)."

link

answered 23 Feb '11, 20:46

psychemedia's gravatar image

psychemedia ♦♦
1.1k323961
accept rate: 11%

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×7
×5
×1
×1

Asked: 09 Feb '11, 17:59

Seen: 1,411 times

Last updated: 23 Feb '11, 20:46

powered by OSQA