Soft Computing Techniques for Information Retrieval

Andreas Nürnberger1
(Professor Lotfi A. Zadeh)
BTexact Technologies, UK and Berkeley Initiative in Soft Computing (BISC)

In this project, we study approaches that combine visualization techniques for document collections with conventional keyword and content-based search methods. During the past period we focused on the development of interactive visualization techniques.

For the visualization of document collections and search results we developed models based on growing self-organizing maps. These maps can be used to arrange documents based on the similarity of the document's content [1]. The trained and labeled map can then be used to visualize the structure of the underlying document collection as well as changes in the collection, e.g., insertion or removal of document subsets [2]. Furthermore, visual information about document density, match of search hits to specific document groups, and similarity to a given sample document in content based searching can be given by different coloring methods. Besides for the analysis of text document collections, the developed models can also be applied for the analysis of multimedia data [3] or to post-process search engine result sets. The visualization and clustering of the obtained results provides additional visual information about the outcome of a search, which is more intuitive than a pure ordered list of search hits.

A further advantage of the developed approaches is that manually defined lists of index terms or a classification hierarchy, which are usually subjectively labeled and require expensive maintenance, are not needed. Especially, in rapidly changing document collections, such as collections of scientific research publications, classification systems that are not frequently updated and do not reflect the user’s classification criteria, are usually not accepted.

[1]
A. Nürnberger, A. Klose, and R. Kruse, "Self-Organising Maps for Interactive Search in Document Databases," Intelligent Exploration of the Web, ed. P. S. Szczepaniak, J. Segovia, J. Kacprzyk, and L. A. Zadeh, Physica-Verlag, 2002.
[2]
A. Nürnberger and M. Detyniecki, "Visualizing Changes in Data Collections Using Growing Self-Organizing Maps," Proc. Int. Joint Conf. Neural Networks, Honolulu, HI, May 2002.
[3]
A. Nürnberger and A. Klose, "Improving Clustering and Visualization of Multimedia Data Using Interactive User Feedback," Proc. Int. Conf. Information Processing and Management of Uncertainty in Knowledge-Based Systems, Annecy, France, July 2002.
1Postdoctoral Researcher

More information (http://nuernberger.webhop.org/irsc/) or

Send mail to the author : (anuernb@eecs.berkeley.edu)


Edit this abstract