Desktop search is the name for the emerging field of search tools which search the contents of a user's own computer files, rather than searching the Internet. The emphasis is on data mining all the information that is available on the user's PC, including web browser histories, e-mail archives, word-processor documents, and so on.
Technologies
The only way to achieve reasonable performance when searching several gigabytes of data is to build and maintain an index database. When indexing the files, desktop search tools collect three types of information about files:
- file and directory names
- meta data, such as titles, authors, comments
- content of supported documents.
To search within documents, the tools need to be able to parse many different types of document. This is achieved by using filters that interpret selected file formats. For example, Microsoft Office Filter is used to search inside MS Office documents.
Long-term goals for desktop search include the ability to search the contents of image files, sound files and video by context.
The sector has attracted considerable attention from the emerging struggle between Microsoft and Google. According to market analysts, both companies are attempting to leverage their monopolies (of web browsers and search engines, respectfully) to strengthen their dominance. This bring back memories of the browser wars of the 1990's.
Some of the players in this emerging search market segment are:
- HotBot - Lycos HotBot has an adware desktop search toolbar for IE.
- Copernic - Copernic Desktop Search. Indexes network drives.
- Google - Google Desktop Search. Integrates with the main Google search engine page.
- Autonomy - IDOL Enterprise Desktop Search.
- Microsoft - MSN Toolbar Suite beta incorporates much of the technology promised for Longhorn, the next version of Microsoft Windows. The search integrates into the task bar and Internet Explorer windows.
- Ask Jeeves
- Yahoo! - Yahoo's beta Desktop Search is based around X1.
- AOL - plans to release its own application based on Copernic technology.
- Apple - promised desktop technology for Tiger, the next generation of Mac OS X to be released in 2005. Apple intends to simplify creation of filters for application developers, allowing better indexing of document content.
- X1 - Expensive, but comprehensive, corporate tool.
External links