PDA

View Full Version : How Google and Microsoft taught search to "understand" the Web


News
06-07-12, 06:50 AM
http://cdn.arstechnica.net/wp-content/uploads/2012/06/search-kings.jpg Photo illustration by Aurich Lawson


Despite the massive amounts of computing power dedicated by search engine companies to crawling and indexing trillions of documents on the Internet, search engines still can't do what nearly any human can: tell the difference between a star (http://www.solstation.com/stars/vega.htm), a 1970s TV show (http://www.imdb.com/title/tt0077093/),¬*and a Turkish alternative rock band (http://www.nr1music.com/vega/vega-info.html). That‚??s because Web indexing has been based on the bare words found on webpages, not on what they mean.

Since the beginning, search engines have essentially matched strings of text, says Shashi Thakur, a technical lead for Google‚??s search team. ‚??When you try to match strings, you don't get a sense of what those strings mean. We should have a connection to real-world knowledge of things and their properties and connections to other things.‚??

Making those connections is the reason for recent major changes within the search engines at Microsoft and Google. Microsoft‚??s Satori and Google‚??s Knowledge Graph both extract data from the unstructured information on webpages to create a structured database of the ‚??nouns‚?? of the Internet: people, places, things, and the relationships between them all. The changes aren't cosmetic; for Google, for example, this was the company's¬*biggest retooling to search since rolling out "universal search" in 2007.

Read more (http://arstechnica.com/information-technology/2012/06/inside-the-architecture-of-googles-knowledge-graph-and-microsofts-satori/) | Comments (http://arstechnica.com/information-technology/2012/06/inside-the-architecture-of-googles-knowledge-graph-and-microsofts-satori/?comments=1#comments-bar)



http://feeds.feedburner.com/~ff/arstechnica/index?i=lA9N1JjKeJQ:YXWstrUBrbk:V_sGLiPBpWU (http://feeds.arstechnica.com/~ff/arstechnica/index?a=lA9N1JjKeJQ:YXWstrUBrbk:V_sGLiPBpWU) http://feeds.feedburner.com/~ff/arstechnica/index?i=lA9N1JjKeJQ:YXWstrUBrbk:F7zBnMyn0Lo (http://feeds.arstechnica.com/~ff/arstechnica/index?a=lA9N1JjKeJQ:YXWstrUBrbk:F7zBnMyn0Lo) http://feeds.feedburner.com/~ff/arstechnica/index?d=qj6IDK7rITs (http://feeds.arstechnica.com/~ff/arstechnica/index?a=lA9N1JjKeJQ:YXWstrUBrbk:qj6IDK7rITs) http://feeds.feedburner.com/~ff/arstechnica/index?d=yIl2AUoC8zA (http://feeds.arstechnica.com/~ff/arstechnica/index?a=lA9N1JjKeJQ:YXWstrUBrbk:yIl2AUoC8zA)
http://feeds.feedburner.com/~r/arstechnica/index/~4/lA9N1JjKeJQ

More... (http://feeds.arstechnica.com/~r/arstechnica/index/~3/lA9N1JjKeJQ/)