VanRoekel's open data strategy may have some limitations

Federal CIO Steven VanRoekel is a strong supporter of APIs for government data, but a transparency group urges a broader perspective.

While Federal Chief Information Officer Steven VanRoekel is a strong supporter of government-sponsored application programming interfaces, or APIs, to distribute federal data to the public, a major transparency group is suggesting that APIs alone can be limiting.

APIs are specifications that serve as interfaces between software. For example, Google Maps has an API that allows third-party websites to overlay restaurant reviews on top of Google Maps. APIs are what allow app developers to write apps that pull data from a database down to a user's smart phone or tablet computer. The API controls what information is accessible to the third-party sites or devices, and how it can be used. 

VanRoekel famously said, “Everything should be an API,” in April 2011, according to news reports, including a report by O'Reilly Radar. At that time VanRoekel was leading the redesign of the FCC.gov website in his previous role as FCC managing director.

He has continued to endorse APIs as a means for distributing government data from sites such as Data.gov. ”VanRoekel is vocal in his support of open data and the use of APIs,” wrote Justin Ellis in an article published March 13 by Nieman Journalism Lab. In the article, VanRoekel also spoke of APIs as providing a platform to users to experience government data in interesting ways, as an alternative to simply pushing out data through a 'dumb pipe.'  While he may not advocate APIs as the exclusive channel for providing data to the public, it's clearly his preferred course of first resort.

However, Sunlight Labs, an arm of the prominent transparency group Sunlight Foundation, suggests that APIs should be only one of the strategies for distributing government data -- and not necessarily the first choice. Sunlight did not connect their conclusions to VanRoekel specifically, but their thoughts do concern the technology VanRoekel champions.

APIs should be used to supplement, not displace, the availability of direct data downloads to the public, Eric Mill, Web and mobile developer for Sunlight, wrote in a March 21 blog entry on the Sunlight Labs Blog.

Mill presented several reasons why direct data downloads are still the best default strategy. For one, developing government APIs to distribute data might actually limit flexibility for users of the data, Mill wrote.

“There's no way to predict ahead of time the right data format and structure for every client who's interested in your data. Expect clients to need to transform your data for their own requirements, and for that transformation to require clients to first obtain all of your data,” Mill wrote.

In addition, creating APIs is time-consuming, expensive and difficult for government agencies, he added.

“Providing bulk access is several orders of magnitude less work on the part of the provider than building and maintaining an API,” Mill wrote in the blog. “An API is a system you need to design, create, keep running, attend to, and worry about. Bulk data access is uploading some files, forgetting about it, and letting [hyper text transfer protocol, or] HTTP do the work.”

Overall, government should take a nuanced view of APIs, Mill recommended. VanRoekel did not immediately respond to a request for comment.

“To sum up, Sunlight is pro-API - we make our own, and we welcome them from the government when they enhance access to information (the FederalRegister.gov API is a particularly good example),” Mill wrote. “However, the first step government should take, in nearly all cases, is to offer the data directly and in bulk.”