Michael Daconta: The semantic Web

Need help sorting through Web services? What is the semantic Web? Internet expert and researcher Michael Daconta, author of a book by the same title, answered questions Feb. 12 in a live, online forum. <a href="http://gcn.com/forum/qna_forum/24897-2.html">Transcript</a>

Need help sorting through Web services? What is the semantic Web? Internet expert and researcher Michael Daconta, author of a book by the same title, answered questions Feb. 12 in a GCN.com Forum.Daconta is director of Web and technology services for systems integrator APG McDonald Bradley Inc. of McLean, Va. As part of that job, he is chief architect for the Defense Intelligence Agency’s Virtual Knowledge Base, a project to compile a directory of Defense Department data through Extensible Markup Language ontologies.Daconta has written a number of technical papers and books. Most recently, he co-wrote the 2003 book The Semantic Web, along with Leo Obrst and Kevin Smith. The book is a primer on how XML, Web services and the emerging semantic Web fit together.Before working on the Virtual Knowledge Base, Daconta helped create a set of electronic mortgage standards for Fannie Mae. In the Army, Daconta worked as a programming section chief on combat and intelligence simulation software at Fort Huachuca, Ariz.Daconta received a bachelor’s degree in computer science from New York University and a master’s in computer science from Nova Southeastern University. Welcome. We'll start the forum in a few moments. Stand by.response_line/response_line response_line/response_line Blogging [Weblogging] has become really popular on the Internet. What can government agencies learn from the blogging community?response_line /response_line How persistent do you anticipate ontologies will be? Put a different way, will we end up with legacy ontologies as the real world that they were mapped to changes?response_line Good question Alec. We certainly will. That is why the OWL ontology specificationwhich JUST became a recommendation (see www.w3.org for the announcement) hasfeatures for versioning ontologies, stating that one is superceded by another and mapping between them./response_line What is the Departmnet of Defense Discover Metadata Standard? How is it different from the Dublin Core [a set of metadatastandards for business use]?response_line I am part of the team creating the DDMS XML Schema. DDMS is standard discovery metadata for DOD resources that is very similar to Dublin Core. In fact, we will be creating an XSLT stylesheet to convert from both DDMS to Dublin Core and vice versa. The key difference between DDMS and Dublin Core is that DDMS is more detailed than Dublin Core; however, we reused all the key concepts from Dublin Core./response_line response_line Well, I usually defer these types of security questions to my friend and co-author Kevin Smith. Kevin is our security expert. I can tell you that we are not just leveraging the security on the platform. Kevin has come up with some innovative techniques for Web services security. He has spoken at JavaOne and recently at the Net-centric warfare conference on these techniques./response_line Can you describe the split between RPC and SOAP? How are they different? Which one, in your opinion, is one better?response_line The difference between traditional RPC (which stands for Remote Procedure Call) and SOAP (which stands for Simple Object Access Protocol) is that RPC was a binary specification that allowed a client to call a procedure (also called a method or function) on a remote computer and retrieve the results from the function. SOAP allows that same functionality but using XML over H.T.T.P. Thus, SOAP gives you the same functionality (and a littlemore) with open standards. The genius of RPC is allowing programmers to perform network programming in the same way they write regular programs. The genius of SOAP was making that capability web-friendly, cross-platform and cross-programming environment (for example both .Net and J2EE)./response_line Can I download code that you wrote that demonstrates this concept in a 'real world' application.response_line Yes. There are many real-word examples. You can go to the "mangrove" project at Washington University for some good applications. Also, the company Applied Semantics has good real-world applications. So good, in fact, that the company was purchased by Google. On my web site (www.daconta.net) I have some ontologies and the Wordnet ontology and Web services will also be posted there under the Projects folder./response_line How doable is it, really, to parse an infinitely flexible language like English as you describe? What languages would be easier?response_line Parsing English is not very difficult at all. It is easy to break up an english sentence into words. It is a little harder but possible to then break the words down into parts of speech. We do this now for our government projects and there are several open source packages on the internet that also do it. Additionally, Google has stated that its next major enhancement for its search engine is natural language question answering. As for other languages, there are certainly other ways to receive information on what user a wants to discover--everything from Structured Query Language to graphical browsing./response_line What do you believe will be the most important career skill of the next 20 years?response_line Boy that is a good question especially in regards to the current offshoring problem. From my own career experience, I would say that "innovation" and "motivation" top the list. Especially innovation with an aggressive attitude towards solving problems./response_line In our XML development, it is difficult to agree on tags. How difficult is it to create what you describe as smart data?response_line /response_line response_line Excellent question. Part of the answer lies in the fact that you cannot solve the problem with only half of the metadata equation. In other words, a server side search engine has to know something about the user in order to provide the user with relevant results. In fact, a good definition of relevance is the intersection between the user context and the data source. This fundamental mismatch is evidenced when search engine companies try to take a probabilistic query approach (a few key words that "probably" refer to what you want) and marry it up with a deterministic search space (associative search engines as you discuss). The flip side of the coin can be seen in the failure of "Ask Jeeves" that tried to marry up a deterministic query (like "How many people live in Virginia?") with a probabilistic search space (traditional keyword matching). Again, won't work. So the solution is to marry up determistic models of both the user context and the user queries with a deterministic search space. This is the approach that the modern search engines are now aggressively exploring. Also, the time for us to get this right is running short. Once we have location-aware cell phones and voice-recognition, the requirements for "real-time" relevance will increase dramatically./response_line How is [semantic Web] different from what the [Object Management Group] calls model-driven architecture?response_line They are similar efforts that are working together on certain aspects of data modeling. The semantic Web technologies are being driven by the W3C while the MDA is driven by the OMG. The roots of the semantic Web are the Web while the roots of MDA are the Unified Modeling language (UML). The OMG is working on a UML profile for OWL. This will be a major milestone in the adoption of these technologiess as there are many programmers familiar with UML. So, bridging these technologies is a major step in the right direction./response_line What government agencies are taking the lead in evaluating semantic Web technologies? When do you expect we will begin to see production government applications using semantic Web technology?response_line I work with many of the agencies moving out aggressively in this space. I would say that in my opinion: DIA, OSD, DISA, the Army and EPA are being the most forward-thinking. However, those are just the organizations that I personally know are involved in these technologies. I am sure there are many I do not know about. The horizontall fusion program is spearheading many semantic web technologies and are proving the concepts to work in improving information sharing and interoperability./response_line What platform is the Virtual Knowledge Base being implemented on?response_line The primary platform for server side of the Virtual Knowledge base is Sun hardware and the Solaris operating system./response_line Debugging has to be a challenging process in a full-fledged smart data "product" like the one you have proposed for the application of Semantic Web technology to network-centric warfare. How, for example, do you test for completeness?response_line While debugging is always a challenging process, it is actually the same for a semantic Web product as for any large enterprise system. However, I believe the process will actually get simpler by adopting semantic Web technologies because they rely upon "declarative languages" like XML. The more software components exchange human-readable XML at the interfaces, the easier debugging will be because MOST errors occur at the interfaces./response_line Well, that will wrap up this forum Q&A. Thanks again, Mr. Daconta, for joining us today.response_line/response_line/row
View transcript












Read Daconta's interview with GCN Associate editor Joab Jackson

GCN.com:





Joab Jackson:

Welcome to today's forum. Today we have Michael C. Daconta, an author of The Semantic Web, a recently published book by Wiley Publishing.



We have found Mr. Daconta to be extremely helpful in making sense of the soup of acronyms that is Web services and the Semantic Web. Daconta is also chief architect for the Defense Intelligence Agency’s Virtual Knowledge Base, a project to compile a directory of Defense Department data through Extensible Markup Language ontologies. He is chief director of Web and technology services for systems integrator APG McDonald Bradley Inc. of McLean, Va.



Thanks for logging in today and welcome Mr. Daconta.








Joab Jackson:

Michael Daconta:

Agencies should learn that decentralized sharing of data is the best way to go. Things are too dynamic and fluid to attempt to centralize data sharing.



The Intelink Management Office [an interagency office that oversees top-secret, secret and sensitive but unclassified intranets for intelligence organizations] is rapidly adopting RSS for implementing
publish/subscribe of site feeds/changes for the Intelligence community.
Other agencies should rapidly follow suit.



One thing *NOT* to learn from the blogging community is the tremendous
amount of standards "thrashing" that has gone on with RSS. Our project
uses the RDF Site Summary which is RSS 1.0 because we want a
seamless migration path to have our RSS feeds references OWL ontologies. [Editor's note: OWL is the Web Ontology Language. RDF is the Resource Description Framework] RDF and OWL have just become W3C recommendations. See
http://www.w3.org for details.








Alec in DC:

Michael Daconta:








Wash DC:

Michael Daconta:





Larry McCay - Philadelphia, PA:

What if any aspects of security do you see as being within the scope of the Virtual Knowledge Base and how do you ensure the integrity of the knowledge as it is accessed across the network?



Do you plan on leveraging only the security mechansims provided via the chosen platform?




Michael Daconta:





Joab Jackson:

Michael Daconta:







Randy Spears, Bethesda MD:

Michael Daconta:





Washington:

Michael Daconta:





Susan, Bethesda:

Michael Daconta:





Dave in VA:

Michael Daconta:

Representing knowledge correctly is not easy; however, as semantic web tools mature and as we get more good examples -- it will become easier. This year, I will be participating on the W3C Semantic Web Activities Best Practices group. Our job will be to come up with advice, guidelines and patterns for creating data using RDF and OWL.



One more thing, even the move from unstructured data (like HTML) to semi-structured data like XML can be difficult. I highly recommend that agencies get a senior team of XML experts and functional experts to hammer out a draft standard BEFORE opening it up to a large group. Why? Because you need experienced people to easily settle the basic debate questions that stall large groups. This method has worked for customers who I have supported to develop mortgage standards, military markup and now NCES standards.








Nampa, Idaho:

Many IT professionals working to bring to market associative search engines find themselves stymied, even baffled, by how difficult it is to program a search engine that is an almost exact match with how users view the data and their needs. One group created preset keyword queries that, when shown to the customer, were almost a 100% mismatch from how the user defined the keyword.



So, what is the realistic chance that search will one day be as easy and successful as asking Rover to fetch the eyeglasses case that only you and he know how to find?




Michael Daconta:






Joe B, Washington DC:

Michael Daconta:





Ed , Maryland:

Michael Daconta:





Larry McCay - Philadelphia, PA:

Michael Daconta:





Alec in DC:

Michael Daconta:





Joab Jackson:




X
This website uses cookies to enhance user experience and to analyze performance and traffic on our website. We also share information about your use of our site with our social media, advertising and analytics partners. Learn More / Do Not Sell My Personal Information
Accept Cookies
X
Cookie Preferences Cookie List

Do Not Sell My Personal Information

When you visit our website, we store cookies on your browser to collect information. The information collected might relate to you, your preferences or your device, and is mostly used to make the site work as you expect it to and to provide a more personalized web experience. However, you can choose not to allow certain types of cookies, which may impact your experience of the site and the services we are able to offer. Click on the different category headings to find out more and change our default settings according to your preference. You cannot opt-out of our First Party Strictly Necessary Cookies as they are deployed in order to ensure the proper functioning of our website (such as prompting the cookie banner and remembering your settings, to log into your account, to redirect you when you log out, etc.). For more information about the First and Third Party Cookies used please follow this link.

Allow All Cookies

Manage Consent Preferences

Strictly Necessary Cookies - Always Active

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Sale of Personal Data, Targeting & Social Media Cookies

Under the California Consumer Privacy Act, you have the right to opt-out of the sale of your personal information to third parties. These cookies collect information for analytics and to personalize your experience with targeted ads. You may exercise your right to opt out of the sale of personal information by using this toggle switch. If you opt out we will not be able to offer you personalised ads and will not hand over your personal information to any third parties. Additionally, you may contact our legal department for further clarification about your rights as a California consumer by using this Exercise My Rights link

If you have enabled privacy controls on your browser (such as a plugin), we have to take that as a valid request to opt-out. Therefore we would not be able to track your activity through the web. This may affect our ability to personalize ads according to your preferences.

Targeting cookies may be set through our site by our advertising partners. They may be used by those companies to build a profile of your interests and show you relevant adverts on other sites. They do not store directly personal information, but are based on uniquely identifying your browser and internet device. If you do not allow these cookies, you will experience less targeted advertising.

Social media cookies are set by a range of social media services that we have added to the site to enable you to share our content with your friends and networks. They are capable of tracking your browser across other sites and building up a profile of your interests. This may impact the content and messages you see on other websites you visit. If you do not allow these cookies you may not be able to use or see these sharing tools.

If you want to opt out of all of our lead reports and lists, please submit a privacy request at our Do Not Sell page.

Save Settings
Cookie Preferences Cookie List

Cookie List

A cookie is a small piece of data (text file) that a website – when visited by a user – asks your browser to store on your device in order to remember information about you, such as your language preference or login information. Those cookies are set by us and called first-party cookies. We also use third-party cookies – which are cookies from a domain different than the domain of the website you are visiting – for our advertising and marketing efforts. More specifically, we use cookies and other tracking technologies for the following purposes:

Strictly Necessary Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Functional Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Performance Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Sale of Personal Data

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.

Social Media Cookies

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.

Targeting Cookies

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.