Big Data

DARPA wants better ways to search the web

Big Data

The Defense Advanced Research Projects Agency is looking for a new way for defense and other government agencies -- and eventually the public -- to search the ocean of Internet data more deeply.

The research agency said it wants ways to perform user-defined, domain-specific searches of public information, instead of the current commercial one-size-fits-all approach that uses the same set of search tools for all inquiries.

In a Feb. 9 statement, DARPA said that while that commercial approach has been wildly successful, it doesn't work as well for some government applications.

For example, commonly used search systems remain mostly manual processes that don't save sessions, require almost exact input with one-at-a-time entry, and don't organize or aggregate results beyond a list of links. They also miss information that could be embedded deeper in the web that aren't indexed by standard search engines, and skip over information that might be shared across web pages.

DARPA's statement said the initial application for its deep-web "Memex" program would enhance the Defense Department's efforts to combat human trafficking, a scourge that has a significant web presence. Using a more flexible search technology, DARPA contends the DoD could look deeper into online forums, chats, advertisements, job postings, hidden services and other out-of-the-way sites that enable the growing industry of modern slavery. An index curated for the counter-trafficking domain, along with configurable interfaces for search and analysis, would unlock new opportunities to uncover and derail trafficking enterprises, the DARPA statement said.

DARPA launched the Memex program looking to develop a next-generation search engine that will allow users to extend the reach of current search capabilities and quickly and thoroughly organize subsets of information based on individual interests. The program, the agency said, also aims to produce search results that are more immediately useful to specific domains and tasks, and to improve the ability of military, government and commercial enterprises to find and organize mission-critical publicly available information on the Internet.

"We're envisioning a new paradigm for search that would tailor indexed content, search results and interface tools to individual users and specific subject areas, and not the other way around," Chris White, DARPA program manager, said in the statement. "By inventing better methods for interacting with and sharing information, we want to improve search for everybody and individualize access to information. Ease of use for non-programmers is essential."

Through its solicitation, DARPA said, the program would explore three technical areas of interest: domain-specific indexing, domain-specific search, and DoD-specified applications.

DARPA noted that it is not interested in proposals for attributing anonymous services, deanonymizing or attributing identity to servers or IP addresses, or accessing information not intended to be publicly available. The agency said the program plans to use commodity hardware and emphasize creating and leveraging open source technology and architecture.

The agency plans a proposer's day on Feb 18 in Arlington, Va., to familiarize interested parties with the program.

Reader comments

Please post your comments here. Comments are moderated, so they may not appear immediately after submitting. We will not post comments that we consider abusive or off-topic.

Please type the letters/numbers you see above