CLUSTER: An Approach to Contextual Language Understanding

Yigal Arens

EECS Department
University of California, Berkeley
Technical Report No. UCB/CSD-86-293
April 1986

http://www2.eecs.berkeley.edu/Pubs/TechRpts/1986/CSD-86-293.pdf

Understanding natural language in context requires the existence of a model of the surrounding world. Such a model must include representations for the objects and activities present in the language being processed, in the surrounding physical environment, and in relevant past experiences of the understander.

The CLUSTER theory of contextual language understanding addresses this issue. CLUSTER has two main components: a theory of modeling the context of an ongoing conversation, and a theory of language analysis.

The first component of CLUSTER theory concerns the model of the world which an understanding system has: the objects and processes in the world that are to be represented, and the relation of these to the understanding system itself. The model emphasizes those elements that are more significant in the current situation, while ignoring information that is relevant. This thesis characterizes a mechanism for maintaining a model of the world, the Context Modeler, and the model of the world maintained by it, called the Context Model. The Context Modeler constructs the Context Model during the course of interacting with another language user.

The second component of CLUSTER theory concerns language analysis, the production of a representation of the meaning of a given sentence. Natural language analysis provides the basis for constructing the Context Model. The Context Model, in turn, is a major resource used by the language analyzer. This thesis shows the centrality of the role played by a model of the context to a system's ability to understand natural language input.

In addition to characterizing these mechanisms, this thesis also describes particular implementations of CLUSTER. The implementation of CLUSTER's language analysis component is named PHRAN, for PHRasal ANanalyzer. The implementation of CLUSTER's context modeling component is referred to as The Context Modeler. Both components are combined in the UNIX Consultant (UC), a natural language help facility that allows new users of the UNIX operating system to learn about UNIX. Users do so by holding a natural language dialogue with UC.


BibTeX citation:

@techreport{Arens:CSD-86-293,
    Author = {Arens, Yigal},
    Title = {CLUSTER: An Approach to Contextual Language Understanding},
    Institution = {EECS Department, University of California, Berkeley},
    Year = {1986},
    Month = {Apr},
    URL = {http://www2.eecs.berkeley.edu/Pubs/TechRpts/1986/6109.html},
    Number = {UCB/CSD-86-293},
    Abstract = {Understanding natural language in context requires the existence of a model of the surrounding world. Such a model must include representations for the objects and activities present in the language being processed, in the surrounding physical environment, and in relevant past experiences of the understander. <p> The CLUSTER theory of contextual language understanding addresses this issue. CLUSTER has two main components: a theory of modeling the context of an ongoing conversation, and a theory of language analysis. <p> The first component of CLUSTER theory concerns the model of the world which an understanding system has: the objects and processes in the world that are to be represented, and the relation of these to the understanding system itself. The model emphasizes those elements that are more significant in the current situation, while ignoring information that is relevant. This thesis characterizes a mechanism for maintaining a model of the world, the Context Modeler, and the model of the world maintained by it, called the Context Model. The Context Modeler constructs the Context Model during the course of interacting with another language user. <p> The second component of CLUSTER theory concerns language analysis, the production of a representation of the meaning of a given sentence. Natural language analysis provides the basis for constructing the Context Model. The Context Model, in turn, is a major resource used by the language analyzer. This thesis shows the centrality of the role played by a model of the context to a system's ability to understand natural language input. <p> In addition to characterizing these mechanisms, this thesis also describes particular implementations of CLUSTER. The implementation of CLUSTER's language analysis component is named PHRAN, for PHRasal ANanalyzer. The implementation of CLUSTER's context modeling component is referred to as The Context Modeler. Both components are combined in the UNIX Consultant (UC), a natural language help facility that allows new users of the UNIX operating system to learn about UNIX. Users do so by holding a natural language dialogue with UC.}
}

EndNote citation:

%0 Report
%A Arens, Yigal
%T CLUSTER: An Approach to Contextual Language Understanding
%I EECS Department, University of California, Berkeley
%D 1986
%@ UCB/CSD-86-293
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/1986/6109.html
%F Arens:CSD-86-293