|
ABSTRACT
This study investigated various techniques for systematically abbreviating English words and names. Most of the attention was given to the techniques which could be mechanized with a digital device such as a general purpose digital computer. Particular attention was paid to techniques that could process incoming information without prior knowledge of its existence (i.e., no table lookups). Thirteen basic techniques and their modifications are described. In addition, most of the techniques were tested on a sample of several thousand subject words and several thousand proper names in order to provide a quantitative measure of comparison.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
| |
3
|
BLAIR, C R. A program for correcting spelhng errors Inf. Contr. 3 (1960), 60-67.
|
| |
4
|
BLOOMER, J G. Bloomer's Commerczal Cryptograph--A Teletype and Double Index- Holocryptic Cipher. (A. Roman & Co., San Francisco, Calif, 1874).
|
| |
5
|
BURTON, N. G AND LICKLIDER, J. C .R. Long-range constraints in the statistical structure of printed English. Am. J. PzychoI 68 (1955), (550-653.
|
| |
6
|
CHAPANIS, A. The reconstruction of abbrewatcd printed messages J. Expemmental Psychol. 48 (1954).
|
| |
7
|
Cox, G. J., CASEY, R. S. AND BAILEY, G.F. Recent developments in keysort cards. J. Chem Educ. 24 (1947), 65-70.
|
| |
8
|
EVANS, M. W., McELWIAN, C. K. AND YAN HOOSEN, F. Machine correction of galbled
|
| |
9
|
FRISHBERG, M. Several techniques for obtaining 60% go 150% efficiency improvements in storage aud retrieval systems usiug general purpose computers Paper presented at the 15th National Conference of the ACM, Milwaukee, AugusL, 1960
|
| |
10
|
FRUMKINA, R.M. Some procedural problems in compihng frequency dictionaries (statistical structure of dictionary and text) A translation of an article which appeared :n the Russisn-language periodical Mashznnye Petered Prikladaaya Lingvistika (Machine Translation and Applied Linguistics), No. 2(9), Moscow (1959). Translation available from Office of Technical Services, U. S. Dept. of Commerce, Document No. JPRS: 3599 (26 August 1900)
|
| |
11
|
GAINES, H. F Cvyptanalysis. (Dover Publications, Inc., New York, 1956).
|
| |
12
|
GRIFFITH, R.T. The Minimotion typewriter keyboard. J. Frankltn Inst. 248, (1949), 399-436.
|
 |
13
|
|
| |
14
|
LU HN. P Superimposed coding with the aid of randomizing squares for use in mechnnical information searching systems In CASEY, PERRY, KENT and BERRY, Punched Cards-Their Application to Science and Industry, 2d ed., cb. 23, Reinhold Pub. Corp., New York, 1958.
|
| |
15
|
MANDELBROT, B. Simple games of strategy occurring in communication through ha. rural languages. IRE Trans. PGIT-3 (1954), 124-137.
|
| |
16
|
MILLER, G. A Some effects of intermittent silence, Am J. Psychol. 62 (t957), 311-313.
|
| |
17
|
MILLER, G A. AND FRIEDMAN, E.A. The reconstruction of mutilated English texts. i nf. Contr 1, (1957), 38-55.
|
| |
18
|
MILLER, G. A , NNEWMAN, E. B. AND FRIEDMAN, E.A. Length-frequency statistics for written English, Inf. Contr 1, (1958), 370-389 (This research was conducted under contract AF (33(038)-14343, and appears as ASTIA Report No. AD-160 709.)
|
| |
19
|
MILLER, G. A. AND NEWMAN, E. B. Tests of a statistical explanation of the rankfrequency feint:on for words m written F, ngl:sh, Am. J Psychol. 63 (1958), 209-218.
|
| |
20
|
NEWCOMBE, H. B , KENNEDY, J. M., AXFORD, S J .~ND JAMES, A. ~. Automatic linkage of vital records, Science 130 (t959), 954-959.
|
| |
21
|
NEWMAN, E.B. The Pattern of vowels and consonants in various languages, Am. J. Psychol 64, (1951), 369-379.
|
| |
22
|
NEWMAN, E. B. AND GERSTMAN, L.S. A new method for analyzing printed English, J. Experimental Psychol 44, (1952), 114-125
|
| |
23
|
NEWMAN, E. B AND WAUGH, N. C The redundancy of texts in three languages. Inf. Contr. 3 (1960), 141-153.
|
| |
24
|
OETTINGER, A. G. The distribution of word lengths in technica,1 Russmn Mech. Translation 1, (1954).
|
 |
25
|
|
| |
26
|
OHAVER, M E. Cryptogram Solving. (Stoneman Press, Columbus, Ohio, 1933).
|
| |
27
|
OHLMAN, H. Subject word letter frequencies with applications to superimposed coding. Proceedings lnternaionl Conference on Scientific information, Washington, D. C. (November 1958).
|
| |
28
|
PRATT, F. Secret and Urgent, The Story of Codes and Ciphers. (Blue Ribbon Books, Garden City, N. Y., 1942).
|
| |
29
|
REMINGTON-RAND. Soundex--foolproof filing system for finding any name in the file. Brochure LBVS09 (undated).
|
| |
30
|
REMINGTON-RAND. Idem sonans says it's legal (Soundex). Brochure LBV528 (undated).
|
| |
31
|
SHANNON, C.E. Pre&ction and entropy of printed English. Bell System Tech J. 80, (1951), 5O-64.
|
| |
32
|
SMITH, L.D. Cryptography-The Science of Secret Writing. (W. W. Norton Co , Inc., New York, 1943).
|
| |
33
|
TAUTON. B. W. Name code--a method of filing accounts alphabetmalty on a computer. Data Proc 2, (March 1960), 23-24.
|
| |
34
|
WEST, M. A General Servzce L~st of English Words wzth Semantic Frequencies (Longroans, Green, and Co., New York, 1953).
|
| |
35
|
YNGVE, V. H. Gap analysis and syntax, IRE Trans. inf. Theory IT-2, (1956), 106-112
|
| |
36
|
ZIPF, G. K The Psychobiology of Language. (Houghton Mifltm Co., Boston, 1935).
|
| |
37
|
ZIPF, G.K. ltuman Behawor and the Principle of Least Effort. (Addison-Wesley Publishing Co., inc., Cambridge, Mass., 1949).
|
| |
38
|
BOURNE, C. P AND FOND, D.F. A study of the statistics of letters in English words. Inf. Contr. 4, 1 (Mar. 1961), 48-67.
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
|