PEEC
Partitioning Enron Email Corpus


I'm email

Introduction

Email has become the primary mechanism for communication within many large corporations and is becoming the primary form of informal inter-organizational communication. Indeed recent research has noted that email is used to manage personal information - people send email to themselves, deliberately store data as mail attachments, and use mail folders and time stamps to index their documents (Whittaker et al, 2006). The Enron email collection is a large corpus of authentic email data and as such is a unique and invaluable research tool in the search for the next generation of organizational software. The goal of the PEEC project was to add annotations to a significant portion of the Enron data, there by making the data amenable to a wider range of experiments.

.


back homeHOME