Data

Why the National Archives needs punch-card readers

Shutterstock image.

Scruffy Millennials covet old record players because they dig the format; the National Archives and Records Administration keeps old file players around because legacy digital data demand them.

"I am preserving every file format that has ever existed on the web, or that any of you have ever used in your work on a daily basis," said Leslie Johnston, NARA's director of digital preservation, who spoke at a March 10 FedScoop event. "In one transfer from one agency, we received not only their email, their Word documents, their PDFs, their PowerPoints -- we actually received the entire contents of their hard drives."

NARA faces a problem of sheer scale, Johnston said, as it will need to manage 500 petabytes of data by 2020.

But diverse file formats are a challenge all their own.

"If our records are not accessible, then they have not been properly preserved and managed," Johnston noted. The Obama administration has pushed for electronic record-keeping wherever possible, but Johnston noted NARA's dual mandate to both offer records in modern, accessible formats and to maintain the original, "authentic" file formats.

The agency gets requests for data in all manner of formats – Johnston said she'd recently received a request for data on punch cards – and sometimes receives records in surprisingly outdated formats.

NARA must be able to read and process the information, so the agency maintains a stable of "vintage media readers" that include various disc and tape players.

For NARA, the management struggle will be constant in coming years, said Brian Houston, engineering VP at Hitachi Data Systems' federal outfit.

Houston said Hitachi has been working with NARA on versioning files, so that a record can be linked to both modern and its original formats instead of having to be copied into a completely separate file.

The many formats, many readers problem isn't going away, Houston acknowledged, but the private sector may see an opportunity: thanks to federal record-keeping, there will always be a market for CD players and the like.

"I'm sure there's somebody in the industry who'd love to be able to have that niche," Houston said.

About the Author

Zach Noble is a staff writer covering digital citizen services, workforce issues and a range of civilian federal agencies.

Before joining FCW in 2015, Noble served as assistant editor at the viral news site TheBlaze, where he wrote a mix of business, political and breaking news stories and managed weekend news coverage. He has also written for online and print publications including The Washington Free Beacon, The Santa Barbara News-Press, The Federalist and Washington Technology.

Noble is a graduate of Saint Vincent College, where he studied English, economics and mathematics.

Click here for previous articles by Noble, or connect with him on Twitter: @thezachnoble.


Featured

  • Cybersecurity

    DHS floats 'collective defense' model for cybersecurity

    Homeland Security Secretary Kirstjen Nielsen wants her department to have a more direct role in defending the private sector and critical infrastructure entities from cyberthreats.

  • Defense
    Defense Secretary James Mattis testifies at an April 12 hearing of the House Armed Services Committee.

    Mattis: Cloud deal not tailored for Amazon

    On Capitol Hill, Defense Secretary Jim Mattis sought to quell "rumors" that the Pentagon's planned single-award cloud acquisition was designed with Amazon Web Services in mind.

  • Census
    shutterstock image

    2020 Census to include citizenship question

    The Department of Commerce is breaking with recent practice and restoring a question about respondent citizenship last used in 1950, despite being urged not to by former Census directors and outside experts.

Stay Connected

FCW Update

Sign up for our newsletter.

I agree to this site's Privacy Policy.