homepage

What is SOCRATES?

This project, SOCRATES, is a social-computational system and platform for the study of social media using crowdsourcing. The project will develop a framework (See figure below) and a technical system through which researchers can collect content from one or more social media sources, explore the collected content to help generate hypotheses, and analyze the content the content to produce insights, findings, and research results in true social-computational scale.

Components of SOCRATES

Collect

The SOCRATES Collect component will provide seamless support for collecting vast amounts of data from social media sites easily and effectively. This interactive component will let the researchers specify their needs quickly and intuitively (e.g., using keywords with source-selection), get them data from these disparate data sources, transform the data into standardized structured formats, and allow easy modification to initial setup and criteria for data collection. One of the important lessons learned from previous experiences, as well as preliminary investigations, is that such a component should be easily integrated into existing systems and practices. The project will therefore address the challenge of collecting large amounts of social media data with minimal effort on the researchers’ end, in terms of learning new environments or assuring proper workflow with their existing systems.

Explore

The SOCRATES Explore component will offer visualizations of the data that enable targeted exploration, allowing researchers to gain understanding and insight into the data independently. The project will develop an interactive application that allows researchers to examine the collected material along multiple dimensions: being able to explore high-level aggregate trends, alongside individual content items (“overview first, details on demand” is a related idea in the visualization community). The researchers will be able to share the interactive visualization and data collected with others, to help “crowdsource” the exploration and hypothesis generation process. Users will be able to explore, comment, and provide insights in a way that enriches the data, and to provide new hypotheses about it to the researchers.

Analyze

To analyze social media data at scale, SOCRATES will support efficient, accurate, and valid annotation of the collected content using a specialized crowdsourcing environment. The purpose of such annotation is multifaceted, including but not limited to categorization leading to evidentiary inferences or hypothesis testing by social scientists and algorithm development by information scientists.