U-M leads $4 million project to preserve poll and survey data


ANN ARBOR, Mich.---In the thick of a presidential election, the latest findings from surveys and polls are reported on a daily basis. But much of the data behind the news on American public opinion is literally here today and gone tomorrow.

"At least half the survey and poll data collected since the 1940s has disappeared," said historian Myron Gutmann, director of the Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan Institute for Social Research. "We're not sure yet if it's gone permanently or not."

Gutmann is the principal investigator on a new $4.1 million project to acquire and preserve data from opinion polls, voting records, large-scale surveys and other social science studies. Funded primarily by the Library of Congress, the world's largest library, the three-year project is a broad-based partnership between ICPSR, the world's largest academic social science data archive, and five other institutions.

Other institutions involved in the project are the Roper Center for Public Opinion Research at the University of Connecticut, the Howard W. Odum Institute at the University of North Carolina-Chapel Hill, the Henry A. Murray Research Center at Harvard's Radcliffe Institute, the National Archives and Records Administration, and the Harvard-MIT Data Center.

"This effort will ensure that future generations of Americans have access to vital material that will allow them to understand their nation, its social organization and its policies and politics," Gutmann said.

For three-quarters of a century, public opinion polls, social surveys and other kinds of structured interviews have tracked people's values, attitudes, knowledge and behavior. Surveys have done more than predict the outcomes of elections or tell us when presidents gain or lose popularity. They inform us about aging, health and health care, race relations, women's rights, employment and family life---the full story of the social and cultural tapestry that makes up our nation. They provide the data necessary for sound, empirically based policy-making.

But a huge quantity of this data is missing or at-risk. "It has not been archived and without aggressive activities to locate and preserve it, it will disappear for good," Gutmann said. "This at-risk data can be found on the computers of individual researchers and research institutions, in bookcases and libraries, even in boxes of punched cards stored in warehouses. Some data reside on websites that don't have truly persistent URLs."

The good news, Gutmann says, is that the missing material has left tracks that researchers affiliated with the new project will follow, in the form of news releases, public grant announcements and publications describing the research. After identifying and finding at-risk content, the project aims to acquire the data, assure its security and prepare public use files that safeguard confidentiality.

"Our goal is to assure that the material remains accessible, complete, uncorrupted and usable over time," Gutmann said. "Rapid technological change will always threaten the viability of digital materials produced in previous years under obsolete technological conditions. But this project will greatly enhance our ability to preserve important data collections."

Source: Eurekalert & others

Last reviewed: By John M. Grohol, Psy.D. on 21 Feb 2009
    Published on PsychCentral.com. All rights reserved.