FOSE

The mosaic effect and big data

open eye and data

The proliferation of government data sets is providing developers with ample fodder for writing useful and potentially profitable applications around census, weather, health, energy, business, agricultural and other information. But as the government makes more and more data discoverable and machine readable, there is the threat that disparate threads can be pieced together in a way that yields information that is supposed to be private.

This kind of analysis through the combination of big data sets is called the mosaic effect. And it isn't necessarily bad, Marion Royal, director of Data.gov at the General Services Administration, said at a May 13 FOSE session. He noted, for example, that the combination of big data sets can supply clues on the paths of seasonal flu outbreaks. But there is also the potential for a bad guy to, say, use transportation data and energy production data to figure out where oil and gas are moving on trains and trucks.

More from FOSE

News and notes: May 13

Cobert tees up IT management plans


Plus: GCN, FCW's sister publication covering technology, tools and tactics for public sector IT, is covering FOSE in even greater detail. Get all the GCN coverage here.

The White House publicly released its Open Data Action Plan on May 9, the one-year anniversary of President Barack Obama's executive order that made open data the default setting of the federal government. According to Royal, the government has found "very few instances of agencies putting up data with sensitivities."

The action plan aggregates planned release schedules for agency data sets, including information on health, climate, small business and manufacturing opportunities, crime, education, and public domain information on the federal workforce.

While the government is taking steps to reduce the exposure of personally identifiable information or security threats, the lingering problem is that it is impossible to scope out all the potential future uses of government datasets in advance, said David E. McClure, who works on open data at the National Oceanic and Atmospheric Administration.

"We know there's undiscovered value and unrecognized threats," McClure said. "We need to have some way to manage it and the short answer is, I don't know how to."

Royal suggested that the model of preserving privacy by individual consent might be obsolete when so much data is passively captured by sensors, and the abundance of social media and search data collected by private companies makes anonymization "virtually impossible," he said: "Privacy as a concept is becoming less clear as technology increases and big data becomes more prevalent, and available."

About the Author

Adam Mazmanian is executive editor of FCW.

Before joining the editing team, Mazmanian was an FCW staff writer covering Congress, government-wide technology policy and the Department of Veterans Affairs. Prior to joining FCW, Mazmanian was technology correspondent for National Journal and served in a variety of editorial roles at B2B news service SmartBrief. Mazmanian has contributed reviews and articles to the Washington Post, the Washington City Paper, Newsday, New York Press, Architect Magazine and other publications.

Click here for previous articles by Mazmanian. Connect with him on Twitter at @thisismaz.


Featured

  • Cybersecurity
    Shutterstock photo id 669226093 By Gorodenkoff

    The disinformation game

    The federal government is poised to bring new tools and strategies to bear in the fight against foreign-backed online disinformation campaigns, but how and when they choose to act could have ramifications on the U.S. political ecosystem.

  • FCW PERSPECTIVES
    sensor network (agsandrew/Shutterstock.com)

    Are agencies really ready for EIS?

    The telecom contract has the potential to reinvent IT infrastructure, but finding the bandwidth to take full advantage could prove difficult.

  • People
    Dave Powner, GAO

    Dave Powner audits the state of federal IT

    The GAO director of information technology issues is leaving government after 16 years. On his way out the door, Dave Powner details how far govtech has come in the past two decades and flags the most critical issues he sees facing federal IT leaders.

Stay Connected

FCW Update

Sign up for our newsletter.

I agree to this site's Privacy Policy.