|06-07-12, 07:50 AM||#1|
Join Date: Jun 2009
How Google and Microsoft taught search to "understand" the Web
Photo illustration by Aurich Lawson
Despite the massive amounts of computing power dedicated by search engine companies to crawling and indexing trillions of documents on the Internet, search engines still can't do what nearly any human can: tell the difference between a star, a 1970s TV show,and a Turkish alternative rock band. That's because Web indexing has been based on the bare words found on webpages, not on what they mean.
Since the beginning, search engines have essentially matched strings of text, says Shashi Thakur, a technical lead for Google's search team. 'When you try to match strings, you don't get a sense of what those strings mean. We should have a connection to real-world knowledge of things and their properties and connections to other things.'
Making those connections is the reason for recent major changes within the search engines at Microsoft and Google. Microsoft's Satori and Google's Knowledge Graph both extract data from the unstructured information on webpages to create a structured database of the 'nouns' of the Internet: people, places, things, and the relationships between them all. The changes aren't cosmetic; for Google, for example, this was the company'sbiggest retooling to search since rolling out "universal search" in 2007.
Read more | Comments
|Thread||Thread Starter||Forum||Replies||Last Post|
|Maintain Your Privacy by Manually Accepting and Rejecting "Cookies" (nV News)||MikeC||Open Forum||2||02-02-13 08:15 PM|
|Microsoft Requests Takedowns From Google, But Content Remains on Bing||News||Archived News Items||0||05-25-12 10:30 PM|
|Google names names on copyright takedowns; Microsoft is #1||News||Archived News Items||1||05-24-12 04:42 PM|
|European regulators offer Google chance to settle antitrust violations||News||Archived News Items||0||05-21-12 11:10 AM|
|Google (partially) loses suit to Oracle over use of Java API's||ViN86||Mobile Devices And Smartphones||3||05-17-12 11:25 AM|