The Informational Capacity of the Human Ear

Homer Jacobson1

Department of Chemistry,
Hunter College, New York City

Science 112:143-144 (August 4, 1950)



New concepts of the nature and measure of information (1, 4) have made it possible to specify quantitatively the informational capacity of the human ear. A published estimate (5) gives 330,000 as the approximate total number of monaurally distinguishable tones of all frequencies and intensities. Dividing this figure by see, the approximate average time necessary for the discriminations measured, 1.3 × 106 is arrived at as the total number of distinguishable tone choices the ear can make in a second. The same figure can be obtained by an independent calculation. An extrapolation of Gabor’s data (1) on the efficiency of perception of ‘‘logons,’’ or elementary signals, up to 16 kc gives an average of 18% of the total, or 5,800, as the number perceptible in 1 sec. Using the Riesz intensity discrimination data (3), a weighted average of 230 j.n.d.’s (just noticeable differences) of intensity for pure tones can be obtained, over the whole frequency range. If it is assumed that the number is the same for an individual logon, a total of 230 × 5,800 = 1.3 × 106 distinguishable tones/sec is calculated, in complete agreement with the figure estimated by the first method.

To express the capacity of the ear in the conventional informational units of ‘‘bits’’ (binary digits)/sec, it is necessary to inquire how many of the distinguishable tones are independent of each other. A crude procedure is to assume that neighboring logons can be independently perceived. The total number of bits/sec will then be the product of 5,800, the number of logons/sec, by the average number of bits/logon. The latter figure is calculated from the Riesz data (3) to be 8.2, by taking the weighted average of the log, of the number of intensity j.n.d.’s at each frequency. By this procedure, about 50,000 bits/sec is the estimated informational capacity of the ear.

Since neighboring frequencies are known to mask one another, this figure is certainly high. Wever’s recent critical review (7) presents convincing evidence that the masking is due both to peripheral and central interference phenomena. However, calculation of the effect of masking will be performed under the arbitrary assumption that it is due strictly to stimulation of a broad region on the basilar membrane, in accordance with a "place" theory of hearing. The calculated informational capacity depends only on the masking and intensity data, and is independent of which auditory mechanism is used.

On this basis, response of the ear at any time can be described by the ‘‘stimulation profile,’’ a graph of stimulus intensity vs. position on the basilar membrane. It represents a sort of short period Fourier analysis of the sound along the membrane. The effect of the phenomenon which leads to masking of adjacent frequencies can be represented by stating that the derivative of the profile cannot exceed a certain value. Thus intense sounds ‘‘swallow up’’ faint ones at neighboring frequencies, because stimulus intensity decays insufficiently with frequency to allow the weak tone to be perceived.

At any point in the hearing region, a certain length along the basilar membrane will have to be traversed before a single intensity j.n.d. can be perceived. This can he called the masking distance, and is larger for low tones than for high ones, in. general. With simplifying assumptions, the total number of distinguishable configurations possible to the stimulus profile can be counted. Using the Riesz intensity perception data (3), and averaging with the Wegel and Lane masking data (6), a length of about 0.04 mm on the basilar membrane is obtained as the average masking distance.

Any of three relations may be observed between two points separated by this distance on the profile. The point at higher frequency may be (1) the same intensity, (2) one j.n.d. more intense, or (3) one j.n.d. less intense than the lower frequency point. A simple calculation gives approximately 230 × 3(32/.04) as the total possible number of profiles, since there are 230 j.n.d.’s in the average intensity range, three choices of profile direction in one masking distance, and 32/.04 or 800 masking distances along the total basilar membrane. Hence, the total number of bits per profile is log2 (230 × 3800), or about 1,300 bits. Dividing by 1/6 sec, the time required for the intensity discrimination measurements, a figure of about 8,000 bits/sec is obtained for the informational capacity of one human ear.

Two assumptions cause this figure to be crude. First, it is likely that the use of intensity measurements of shorter duration than 1/6 sec would increase this figure, by allowing more profiles per sec to be possible, even though the amount of information per profile would decrease somewhat. Second, the use of a single averaged masking distance is subject to some error. Phase information, generally of no value except to binaural phenomena, is also neglected.

An important observation is that the dynamic range of the ear scarcely affects the final calculations. Only when the intensity range of the intelligence is cut down to 10 or 20 dB is a sufficient fraction of the possible profiles ruled out to lower appreciably time informational capacity.

Since about 29,000 ganglion cells (2) are to be found in a cochlear nerve, we observe about 0.3 bit/sec of information per nerve fiber. This low figure naturally results from the lack of coding in the cochlear innervation. If the capacity is taken as 1.3 × 106 distinguishable tones/sec, then about 40 tones/sec can be accounted for by each fiber. This is about 10% of the maximum number of impulses which an adapted nerve fiber can carry.

1. Present address: Department of Chemistry, Brooklyn College, Brooklyn, N. Y.

References

  1. Gabor, D., J. Inst. Elec. Eng., Lond., 1946, 93, 429 Pt. II.
  2. Guild, S., Acta Oto-Laryngol, Stockh., 1932, 17, 207.
  3. Reisz, R., Phys. Rev., 1928, 31, 867.
  4. Shannon, C. E., Bell. Syst. tech. J., 1948, 27, 379, 62
  5. Stevens, S. S., and Davis, H. Hearing, its psychology and physiology. New York: John Wiley, 1938.
  6. Wegel, R., and Lane, C., Phys. Rev., 1924, 23, 266.
  7. Wever, E. G., Theory of hearing. New York: John Wiley, 1949.

Created: February 14, 2000
Last Modified: January 8, 2001
HTML Editor: Robert J. Bradbury