|
|
3.3.2 General Results
After training, the training corpus and two test corpora were presented to the network again. (The test corpora also consisted of 4,000 sentences with lengths of up to 20 words.) This time, however, no modifications were made to the weights on the connections -- only the activation values of the 17 output units for each input pattern were saved so that the actual output of the network could be compared to the desired output. The three resulting files (one for each corpus) were 16 megabytes long and full of floating-point numbers. A Perl program was then used to process the files: a two-page summary of the error percentages for each of the output units was created, and the results files were rewritten to show the actual words of each input pattern, rather than the 85-bits long representations of these words. Figure 3.8 shows a sample session of the use of the program.
shivan:~/zin$ restore.input +-+-+ Starting CLASPnet's SNNS English Results Analyzer +-+-+ +-+-+ Copyright 1995-1996. Ezra Van Everbroeck +-+-+ Analyze which results file? demo.res Use which vocabulary file [demo.voc]? Name for RRF file (will overwrite) [demo.rrf]? ++ Writing RRF file: demo.rrf Name for error summary file (will overwrite) [demo.err]? ++ Writing error summary file: demo.err ++ Done. shivan:~/zin$ |
Before I provide more detailed results, it is worth comparing the overall performance of the network on the training corpus with how well it did on the test corpora. Figure 3.9 shows this comparison for various tolerance values. (These tolerance values are similar to the ones discussed above: if the actual activation value of an output unit was 0.85, and the desired output value was 1, then it would be considered to be an error for tolerances 0 and 0.1; with a tolerance of 0.2 an activation value of 0.8 instead of 1 suffices, so the output of the net would be considered correct. A tolerance of 0 is particularly harsh because networks use floating point numbers: even 0.99999 would not be considered correct if the target were 1. Usually, tolerance values of 0.2 and 0.3 are considered acceptable.) The precise numbers of these results can be found in Appendix 5.
There are three general observations I would like to make about Figure 3.9:
| Word | Ind | Imp | Int | ID1 | ID2 | ID3 | Sta | Dec | Ord | WH | YN | Rel | Con | Com | Inf | Voi | Pol |
| the | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 1 | 0.04 | 0 | 0.98 | 0.01 | 0.01 | 1 | 0.92 | 0.01 | 0 | 0 | 0 | 0.09 | 0.01 | 0 | 0.54 | 0.60 | |
| child | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 0.98 | 0.01 | 0 | 1 | 0 | 0.01 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.61 | 0.62 | |
| proclaim | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 0.99 | 0 | 0 | 1 | 0 | 0.03 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.99 | |
| which | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 1 | 0 | 0 | 0.01 | 1 | 0 | 0 | 0.01 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0.81 | 0.81 | |
| pilot | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 1 | 0 | 0 | 0 | 1 | 0.01 | 0 | 0 | 0 | 0.96 | 0 | 0 | 0 | 0.01 | 0 | 0.62 | 0.49 | |
| the | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 1 | 0 | 0 | 0 | 1 | 0.01 | 0 | 0 | 0 | 0.90 | 0 | 0.06 | 0 | 0 | 0 | 0.87 | 0.45 | |
| great | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0.94 | 0 | 0.05 | 0 | 0 | 0 | 0.90 | 0.78 | |
| horses | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0.02 | 0 | 0.97 | 0 | 0 | 0 | 0 | 0 | 0.98 | 0.86 | |
| , | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| which | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |
| 1 | 0 | 0.02 | 1 | 1 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0.98 | 0 | 0 | 0 | 0.28 | 0.11 | |
| can | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |
| 0.99 | 0 | 0 | 1 | 1 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0.99 | 0 | 0 | 0 | 0.81 | 0.03 | |
| never | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |
| 0.96 | 0 | 0 | 1 | 0.98 | 0.01 | 0.02 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0.99 | 0 | |
| eat | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |
| 1 | 0 | 0 | 1 | 1 | 0 | 0.01 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | |
| , | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| can | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 1 | 0 | 0 | 0.12 | 0.91 | 0.01 | 0.36 | 0.55 | 0 | 0.02 | 0.01 | 0.05 | 0.06 | 0.01 | 0.03 | 0.89 | 0.38 | |
| see | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 1 | 0 | 0 | 0.06 | 0.94 | 0.02 | 0.89 | 0.13 | 0 | 0.01 | 0.02 | 0 | 0.04 | 0.17 | 0.01 | 1 | 0.95 | |
| . | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
Let's focus on each group of output units in turn: