Moore’s Law and Web Search »
In 1965 then a co-founder of Fairchild Semiconductor, Gordon Earl Moore predicted that the number of silicon components on a single integrated circuit chip will double every 12 months. He later went on to start Intel with Robert Noyce and worked to make that prediction a reality.
True to his prediction the number of transistors on a microprocessor has been doubling at roughly 18 months for the past 45 years and this trend has come to be known as Moore’s law. It’s estimated, for example, that the semiconductor industry produced more transistors in 2005 than the number of grains of rice produced that year. For a deep look at the remarkable history of semiconductors explore this exhibit at the computer history museum web site.
Unlike Newtonian laws of motion or laws of electromagnetism, Moore’s Law is not one of nature but of human endeavor. In a symposium honoring the 40th anniversary of that prediction Moore himself stated that the law is based more on economics of the industry than on improvements in any particular material or process and has room to go on for another decade at least. In 2009, Intel CEO Craig Barrett said “We can scale it down another 10 to 15 years. Nothing touches the economics of it.” There has been ample speculation and analysis (like this one by author and blogger Andrew Curry) on when Moore’s law will end but another decade of it will surely lead to lot of creative destruction. Former chief editor of Wired magazine Kevin Kelly goes so far as to ask was Moore’s Law inevitable?
Fundamentally Moore’s law upends status quo by doing two things. In every cycle (18 months to 24 months) the cost of silicon components falls by a factor and the performance improves by a factor. Such an exponential decrease in costs and increase in performance offers ample opportunities for nimble and new players to outsmart the slow and old ones and win customers.
What are the implications of Moore’s Law for web search?
Web search as practiced by Google, Yahoo and Bing today essentially boils down to building this stack : crawling, indexing, ranking and presenting web sites as search results to a keyword search. Each layer of this stack is affected by Moore’s Law differently and offers different opportunities.
- Crawling : Crawling is inherently a massively parallel task. You maintain a huge distributed crawl queue and manage several processes pulling the data from the web based on that queue. Cheaper hardware and bandwidth lowers the cost of doing such a crawl every cycle. But unfortunately the amount of data on the web has also been growing exponentially. The web is said to have grown from 5 exa bytes of data in 2002 to 281 exa bytes of data today. One would need enormous cash reserves to fund a crawl that big despite cheapening hardware. Newcomers without access to such cash reserves are at a huge cost disadvantage.
- Indexing : Indexing is the task of organizing all the crawled data in ways that influence ranking and presenting. Modern day search engines build what is called a reverse index that maps keywords to websites they are present in. The process of indexing is a massive, resource intensive task, and newcomers are usually at a cost disadvantage, despite cheapening hardware.
- Ranking : Ranking is the task of retrieving relevant results and ordering them. The secret sauce of each search engine is said to reside here. It involves proprietary algorithms and is carefully controlled. Google is said to use about 200 signals that influence ranking. In some ways Moore’s Law does not directly affect this layer even though most of the analysis that one would need for ranking is done during the indexing phase. Fortunately offerings like Yahoo Boss give newcomers the chance to improve upon the ranking without building a huge crawl or index and the cost of such services are coming down with each cycle.
- Presenting : Presenting is the task of rendering search results in a manner that’s efficient, useful and easily consumable. Despite the heavy lifting that happens at the backend in the previous layers what a typical web search consumer sees is one text box and a list of ten blue links on a sparse search results page. Moore’s Law offers the biggest opportunity for newcomers here as consumers upgrade their personal computers each cycle. Studies show that users expect to see results to their keyword searches in half a second. And –with Moore’s law– what one can do in that half a second improves exponentially every cycle, both on the client side and on the server side. What was not possible to do in the last cycle suddenly becomes possible. With the rekindling of the browser wars tomorrow’s web browser will be vastly more powerful taking advantage of the next generation of multi core processors. They will offer much richer rendering and new forms of interaction. Newcomers to web search will have the best opportunity to disrupt the status quo by riding that wave. One need not limit the interface to keyword search. With the explosion of touch and voice interfaces in the last year or so, new ways of input will emerge that will make today’s keyword search seem archaic.
posted by Manyam Mallela






The internet is 


















Subscribe to our RSS Feed

