FOSE

The mosaic effect and big data

open eye and data

The proliferation of government data sets is providing developers with ample fodder for writing useful and potentially profitable applications around census, weather, health, energy, business, agricultural and other information. But as the government makes more and more data discoverable and machine readable, there is the threat that disparate threads can be pieced together in a way that yields information that is supposed to be private.

This kind of analysis through the combination of big data sets is called the mosaic effect. And it isn't necessarily bad, Marion Royal, director of Data.gov at the General Services Administration, said at a May 13 FOSE session. He noted, for example, that the combination of big data sets can supply clues on the paths of seasonal flu outbreaks. But there is also the potential for a bad guy to, say, use transportation data and energy production data to figure out where oil and gas are moving on trains and trucks.

More from FOSE

News and notes: May 13

Cobert tees up IT management plans


Plus: GCN, FCW's sister publication covering technology, tools and tactics for public sector IT, is covering FOSE in even greater detail. Get all the GCN coverage here.

The White House publicly released its Open Data Action Plan on May 9, the one-year anniversary of President Barack Obama's executive order that made open data the default setting of the federal government. According to Royal, the government has found "very few instances of agencies putting up data with sensitivities."

The action plan aggregates planned release schedules for agency data sets, including information on health, climate, small business and manufacturing opportunities, crime, education, and public domain information on the federal workforce.

While the government is taking steps to reduce the exposure of personally identifiable information or security threats, the lingering problem is that it is impossible to scope out all the potential future uses of government datasets in advance, said David E. McClure, who works on open data at the National Oceanic and Atmospheric Administration.

"We know there's undiscovered value and unrecognized threats," McClure said. "We need to have some way to manage it and the short answer is, I don't know how to."

Royal suggested that the model of preserving privacy by individual consent might be obsolete when so much data is passively captured by sensors, and the abundance of social media and search data collected by private companies makes anonymization "virtually impossible," he said: "Privacy as a concept is becoming less clear as technology increases and big data becomes more prevalent, and available."

About the Author

Adam Mazmanian is executive editor of FCW.

Before joining the editing team, Mazmanian was an FCW staff writer covering Congress, government-wide technology policy and the Department of Veterans Affairs. Prior to joining FCW, Mazmanian was technology correspondent for National Journal and served in a variety of editorial roles at B2B news service SmartBrief. Mazmanian has contributed reviews and articles to the Washington Post, the Washington City Paper, Newsday, New York Press, Architect Magazine and other publications.

Click here for previous articles by Mazmanian. Connect with him on Twitter at @thisismaz.


Featured

  • Telecommunications
    Stock photo ID: 658810513 By asharkyu

    GSA extends EIS deadline to 2023

    Agencies are getting up to three more years on existing telecom contracts before having to shift to the $50 billion Enterprise Infrastructure Solutions vehicle.

  • Workforce
    Shutterstock image ID: 569172169 By Zenzen

    OMB looks to retrain feds to fill cyber needs

    The federal government is taking steps to fill high-demand, skills-gap positions in tech by retraining employees already working within agencies without a cyber or IT background.

  • Acquisition
    GSA Headquarters (Photo by Rena Schild/Shutterstock)

    GSA to consolidate multiple award schedules

    The General Services Administration plans to consolidate dozens of its buying schedules across product areas including IT and services to reduce duplication.

Stay Connected

FCW Update

Sign up for our newsletter.

I agree to this site's Privacy Policy.