next up previous contents
Next: Running RCLASS Directly from Up: RCLASS - Classification Previous: RCLASS - Classification

The User Interface

The user interface for RCLASS is started by typing RCLASS & at the UNIX command prompt or by selecting ``RCLASS'' in the main menu. Afterwards, the window in Figure gif is shown.

  
Figure: Main window of RCLASS.


Files The program takes the name of three input files: table file, dictionary file and rule file. The first two are the same as those given to RGEN, and the rule file should be the output from RGEN. The result of the program is written to the output file, or standard out if no such is given. The output is equal to the input table file with a column add at the end, containing the classification of the system. An updated dictionary file (covering both the rule file and the new table file) is written with the name ``updated.dic''.
Method If there are conflicting rules for some of the objects, RCLASS uses the following methods to classify:

Selecting Rule Base When RGEN traverses its nodes, see Chapter gif, and creates the rules, the nodes are numbered in the order they are visited. Each node take a specific set of attributes into account when generating rules. RCLASS allows the user to choose which nodes in the lattice that are of interest, and this is done by using the window shown in Figure gif, which is displayed when ``Select rulebase no.'' is chosen. The nodes are listed in the order they were traversed, and their attribute numbers, starting from 0, is shown to the right of the check button.

  
Figure: The rulebase selection window.


Selecting Attributes In addition to the ``Select rulebase no. '' option, an option called ``Select attributes'' is available, but is not possible to use both these options at once. ``Select attributes'' lets you select which attributes may be taken into account, and which attributes must be taken into account when classifying. This corresponds to a sub lattice containing some of the nodes, and only rules created at these nodes will be used for classification.

  
Figure: The attribute selection window.

The example in Figure gif uses three attributes, numbered 0, 1 and 2. It is specified that the rules to use should come from nodes in the lattice which has no other attributes than that the attributes (0 1 2), but contains at least the attribute 0. The lattice for this example is shown in Figure gif. Rules from the four nodes (0 1 2), (0 1), (0 2) and (0) will be used for classification in this example. The thick lines and extra circles shown nodes in the sub-lattice from (0 1 2) to (0).

  
Figure: A lattice for three attributes.


Test Set this option if the data set contains a decision as the last column. RCLASS will then check its own decision against the correct decision, and create statistics. The output will contain RCLASS' decision, and not the original one. Some statistics, like the number of correctly classified objects, and the number of objects not recognized by any rule is printed.
Validity When ``validity'' is set, the program will add a final column to its output. This column will contain a number from 0 to 1000, and reflects the system's belief in its own decision, based on the accuracy for the rules used. The validity measure should not be taken literally, but rather as a hint to whether the classification is likely or not.
Full Format RCLASS prints its output by using the dictionary file to translate back to values from the real world if this option is set. Otherwise, the same values as in the table file are used.
Give Alternatives If ``give alternatives'' is set, the program is allowed to give several different possible values for the classification.
Ignore Blocks This option tells the program to ignore all blocks in the rule file.
Change Font By default the program uses a times roman font with size 12 points. This may be changed by setting one or two environment variables before starting the program:

RFONTSIZE The size of the font, in tenths. Defaults to 120.
RFONTDESCRIPTION A complete font description following the rules of the script language Tk. Defaults to *-times-medium-r-normal-*-120-*-*-*-*-*-*


next up previous contents
Next: Running RCLASS Directly from Up: RCLASS - Classification Previous: RCLASS - Classification

Helge Grenager Solheim
Sat May 4 03:30:02 MET DST 1996