Peter Mika – Making the Web searchable
Web Directions South 2011, Sydney, October 13th.
The key idea of the Semantic Web is to make information on the Web easily consumable by machines. As machines start to understand web pages as sources of data that can be easily combined with other public data on the Web, the promise is that search on the Web will move well beyond the current paradigm of retrieving pages by keywords. Instead, search engines will start to answer complex queries based on the cumulative knowledge of the Web.
In this presentation, we overview the basic set of technologies that can be used to annotate web pages so that they can be processed by data-aware search engines. In particular, we discuss the RDFa and microdata standards of the W3C designed for marking up data in HTML pages. We look at the ways in which this information is currently used by search engines, including the latest schema.org collaboration between Bing, Google, and Yahoo!, which provides a basic set of vocabulary items understood by all three major search engines on the Web.
About Peter Mika
Peter Mika is a researcher and data architect at Yahoo! Research in Barcelona, working on the applications of semantic technology to Web search. He received his BS in computer science from Eotvos Lorand University and his MSc and PhD in computer science (summa cum laude) from Vrije Universiteit Amsterdam. His interdisciplinary work in social networks and the Semantic Web earned him a Best Paper Award at the 2005 International Semantic Web Conference and a First Prize at the 2004 Semantic Web Challenge. From 2006 to 2009, he has been a co-chair of the Semantic Web Challenge. Mika is the youngest member elected to the editorial board of the Journal of Web Semantics. He is the author of the book ‘Social Networks and the Semantic Web’ (Springer, 2007). In 2008 he has been selected as one of “AI’s Ten to Watch” by the editorial board of the IEEE Intelligent Systems journal. Peter is a regular speaker at conferences.
Follow Peter on Twitter: @pmika