Decision processes in frequency discrimination Wait Jesteadt*

and Sandra

L. Sims

Bioacoustics Laboratory, Eye and Ear Hospital and Schoolof Medicine, Universityof Pittsburgh, Pittsburgh, Pennsylvania15213 (Received 13 December 1974; revised 23 January 1975) The ability to discriminatea change in frequency was measuredin 2IFC, same-different, and yes-no paradigms using two tasks commom in the literature--discrimination of two pure tones differing only in frequency and detection of frequency modulation (FM). In the pure-tone task, differencesin the level of frequency-discriminationperformance as a function of paradigm were similar to those previously observedfor both frequency and intensity discrimination. The level of performance in 2IFC and same-different exceededthe level of performance in yes-no by more than the theory of signal detection would predict. For the FM detection data, differences in the level of performance between paradigms were less than the theory of signal detection would predict. These differences increased,however, when the same subjectswere tested in a task requiring discrimination of two FM tones. These results suggestthat a memory factor is operating in discrimination tasks to a greater degree than in detection tasks. The memory factor is interpreted in terms of criterion variance. Subject Classification: 65.54, 65.35, 65.75.

INTRODUCTION

that the observer

Early theories about the auditory system were primarily concerned with accounting for its frequency-anal-

ysis ability (Ohm, 1843; Helmholtz, 1863). The ability to discriminate small differences in frequency is one obvious measure of the general ability. Although our understanding of the physiological processes underlying frequency resolution has greatly improved in recent

applies the principles

of statistical

decision theory (Wald, 1950)to distributions of sensory events, that the decision axis is monotonic with intensity, and that memory effects are minimal.

Jesteadt and Bilger (1974) recently have reviewed the predictions of the TSD model concerning relative per-

formance in two-interval forced choice (2IFC), samedifferent (SD), and yes-no (YN) paradigms. In the the-

years, our empirical methods for assessing how well

ory of signal detection, the level of performance is ex-

subjects can discriminate small differences in frequency have not. The variability in the reported data is far greater than that in the corresponding intensity-discrimination data. This greater variability can be traced to

pressed in terms

at least two sources: vidual differences

(1) the existence of greater indi-

in frequency-discrimination

ability

than in intensity-discrimination ability (e.g., Jesteadt and Bilger, 1974, Fig. 1), and (2)the lack of consensus concerning the proper procedures to be used in frequency-discrimination tasks as compared to intensity-discrimination tasks. Our primary concern here is with the second source of variability. One aspect of the methodological problem is the continued use of a wide range of psychophysical procedures and data analyses that result in unrelated

measures

of discrimination.

Another

is the continued use of two very different stimulus configurations--pure tones differing in frequency and pure tones that may or may not be frequency modulated.

In recent studies of intensity discrimination, widespread application of the theory of signal detection (TSD)

(Green and Swets, 1966) has led to the use of a limited range of forced-choice paradigms and uniform data analyses. This uniformity reflects the basic assumption that a similar decision process is involved in all psychophysical procedures. The details of the data analyses reflect specific assumptions cuncerning the nature of the decision process. These specific assumptions are testable, in part, because they lead to precise predictions concerning relative performance in the various paradigms. The traditional TSD model of the decision

process (Swets, Tanner, and Birdsall, 1961)assumes 1161

J. Acoust. Soc. Am., Vol. 57, No. 5, May 1975

of the decision

of the difference

distributions

relative

between the means to the standard

deviation of those distributions. The standard prediction is that performance in 2IFC will be higher than perfor-

mancein YN by a factor of V"•. Predictionsconcerning the SD task are less specific because of the ambiguous role played by the standard tone presented in the first interval. The traditional TSD analysis predicts that performance in SD should be poorer than performance in 2IFC by a factor of 2 and poorer than performance in

YN by a factor of V"•. Sorkin(1962)has argued, however, that if the same standard is used on every trial, performance in SD should never be lower than performance

in YN.

For detection, performance has generally been found to differ less across paradigms than TSD would predict,

with the 2IFC/YN performance ratio less than v•- (Swets and Green, 1961; Schulman and Mitchell, 1966; Markowitz and Swets, 1967; Leshowitz, 1969). Subjects apparently do not make perfect use of the additional information present in 2IFC. For intensity discrimination, the differences in performance are greater than TSD

would predict.

The 2IFC/YN performance ratio is

closer to 2 thanto •

(Leshowitz,1969; Viemeister,

1970; Pynn, Braida and Durlach, 1972; Jesteadt and Bilger, 1974). These deviations from the prediction have led some authors to suggest that memory effects play a greater role in discrimination than originally as-

sumed(Viemeister, 1970; Long, 1973; Jesteadt and Bilger, 1974). Why memory should play a greater role in intensity

discrimination

than in detection is not clear.

Copyright ¸ 1975 by the Acoustical Society of America

1161

Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.59.171.71 On: Thu, 11 Dec 2014 06:58:06

1162

Jesteadtand Sims: Decisionprocesses in frequencydiscrimination

I. DECISION PROCESSESIN PURE-TONE MEASURES OF FREQUENCY

DISCRIMINATION

In the few cases where the nature of the decision process has been considered in the frequency-discrimination literature, two very different models have been suggested. One model was proposed by Pierce and Gilbert (1958) to account for differences in results obtained with

AB and ABX paradigms (Rosenblith and Stevens, 1953). It closely parallels the traditional that the decision

axis

is assumed

TSD analysis, except to be monotonic

with

frequency rather than intensity. This model predicts the relations among paradigms outlined above. The

other model, proposedby Wickelgren (1969), deals only with the SD paradigm and assumes that judgments of

"sameness" are based on the familiarity of the comparison tone.

The decision

axis is monotonic

with familiar-

ity, not frequency. The subject could not use such a system to make high-low judgments and Wickelgren argues that these judgments are mediated by a separate process. Wickelgren's model, therefore, predicts that performance in 2IFC is unrelated to performance in SD. Such a weak prediction would be of little interest were it not for Wickelgren's prediction that performance in SD is generally better than performance in 2IFC in frequency-discrimination tasks. Since the superiority of performance in 2IFC in intensity discrimination has been well established, Wickelgren is, in fact, asserting that there are basic differences in the processing of intensity and frequency information. A number of other authors have made similar assertions (e.g., Stevens, 1957; Tanner and Rivette, 1964; Whipple, 1901). Because the models proposed by Pierce and Gilbert (1958) and by Wickelgren (1969) make differential predictions concerning relative performance in different paradigms, the most straightforward test of the models is to compare performance in these paradigms. Jesteadt

and Bilger (1974) found the relations among 2IFC, SD, and YN paradigms in frequency discrimination to be very similar to those in intensity discrimination and suggested that the same decision process was operating in both cases. The fact that performance in 2IFC greatly exceeded performance in SD and YN led them to re-

ject Wickelgren's model. As in other paradigm comparison studies involving discrimination rather than detection, however, they found it necessary to assume the existence of a memory factor to account for the large deviations from the TSD predictions. Thus, neither model of the decision process was supported unequivocally. Variability in the performance ratios for individual subjects and relatively poor overall performance in the frequency-discrimination tasks increased the ambiguity. One purpose of the present study was to replicate the fixed-standard, frequency-discrimination portion of the earlier

study.

1162

Shower and Biddulph than to Riesz. These references often iuvolve a comparison of new pure-tone data to the earlier FM data, with the implicit assumption that the two stimulus configurations measure the same properties

of the auditory system (e.g., Henning, 1966). We are aware of no direct evidence to support this assumption, and there is, in fact, some evidence to the contrary. In general, studies using pure-tone stimuli have reported smaller DL's than those reported by Shower and Biddulph,

with especially large differences below 500 Hz (Stevens, 1954). Nordmark (1968) has argued that the data reported by Shower and Biddulph are not representative

and that data from more recent FM studies (Meurman, 1954; Groen and Versteegh, 1957; Filling, 1958), when plotted correctly, show DL's as a function of frequency similar to those obtained in studies using pure tones.

The pattern described by Stevens (1954) can still be observed, however, in the replotted data. There are no data in the literature concerning decision processes in FM measures of frequency discrimination. Knowledge of these processes should increase our understanding of the data collected with this stimulus configuration and its relation to pure-tone data. FM conditions were therefore included in the present study to permit a direct comparison of the decision processes in FM and pure-tone measures of discrimination. Because the difference between these measures is greatest at low frequencies, discrimination was tested at 250 Hz as well as at 1000 Hz in all paradigms. III.

A.

EXPERIMENT

I

Method

1. Sub/ects Three members of the laboratory staff served as subjects. They all had prior experience as listeners in frequency-discrimination tasks. 2.

Conditions

Discrimination was tested at 250 and 1000 Hz using pure-tone and FM signals in each of three psychophysical paradigms. All signals were presented at 60 dB SPL. In the pure-tone-signal conditions, the subjects' task was to discriminate between two possible pure tones differing only in frequency. One of the two possible tones was the 250-

or 1000-Hz

standard

and the other

was

higher in frequency by some amount. In the FM-signal conditions, the subjects' task was to discriminate between a 250- or 1000-Hz pure-tone and an FM signal with the same center frequency and an 8-Hz modulation rate.

of both frequency discrimination (Shower and Biddulph, 1931) and intensity discrimination (Riesz, 1928), there

In each of the four conditions (pure-tone and FM signals at 250 or 1000 Hz), discrimination was tested at three degrees of difficulty to generate psychometric functions. In the pure-tone-signal conditions, different degrees of difficulty were obtained by varying the difference in frequency between the standard and the other possible tone. In the FM signal conditions, they were obtained

are many more references

by varying the range of the frequency modulation.

II.

DECISION

FREQUENCY While

PROCESSES

IN FM MEASURES

OF

DISCRIMINATION

modulated

tones

were

used in the classic

in the current

literature

studies

to

J. Acoust. Soc. Am., Vol. 57, No. 5, May 1975

Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.59.171.71 On: Thu, 11 Dec 2014 06:58:06

1163

JesteadtandSims: Decisionprocesses in frequencydiscrimination

1163

3. Paradigms

In the 2IFC paradigm, the subjects were instructed to listen to two signals and decide in which interval the higher of the two possible frequencies or the FM signal occurred. In the SD paradigm, the subjects were in-

structed to listen to two signals and vote "Different" if the second signal was the higher frequency or was modulated. In the YN paradigm, the subjects were instruct-

ed to listen to one signal and vote "Yes" if it was the higher frequency or was modulated. 4. Apparatus The signals for the pure-tone

conditions were gener-

ated by a programmable oscillator (Krohn-Hite 453-2). The signals for the FM conditions were generated by a

Wavetek oscillator (model 116). A second Wavetek oscillator was used to generate the 8-Hz sinusoidal modu-

lating waveform. A programmable attenuator (Wolf, 1972) controlled the amplitude of the modulating wave-

form and, therefore, the rangeof the modulation. • The

I

2

3

I

AF

pure-tone and FM signals were gated by an electronic

IN

2

3

4

5

6

Hz

switch (Grason Stadler model 829D)with a 25-msec

FIG. 1.

rise-decay

digrns--2IFC (squares), same-different (diamonds), and yesno (circles). The six panels contain data for 250 Hz (on the left) and 1000 Hz for the three subjects, SLS, LJF, and CMR.

time.

These devices were controlled by a

PDP-15/20 computer that was also used to record subjects' responses. The softwave and interface have been

Pure-tone frequency discrimination

in three para-

described by Jesteadt (1972). The signals were presented monaurally through TDH-49 phones to the three subjects in separate sound-treated

with the assumptions of the TSD model and Wickelgren's

rooms.

(1969) model of the decision process. We refrain from 5.

Procedure

labeling this measure of performance d • because that

A 2IFC or SD trial consisted of a warning interval, two 500-msec observation intervals separated by 500 msec, an answer interval, and a correct-answer feedback interval. The same pattern was used for YN trials, with the exception of the intersignal interval and second observation

In each daily 2-h session, 100 trials were run at each

preceded by six days of practice during which the appropriate pure-tone frequency differences ranges were determined.

and FM

A preliminary analysis of the data indicated that discrimination performance in the FM conditions was much lower than that in the pure-tone conditions. The FM conditions were therefore repeated using larger FM ranges, so that the level of difficulty of the pure-tone and FM conditions would be more nearly equal. The largest of the three FM ranges in the original data collection was used as the smallest of the three FM ranges in the replication.

tion.

We refer

to the uncorre.cted

discrimination

mea-

sures for different paradigms as d•.zFc, dsD, or these measures for different paradigms, we use the term

The values of dp for the PT and FM conditions are presented in Figs. 1 and 2, respectively. Each panel contains performance data for an individual subject for

the three paradigms. For the FM conditions, the d• for the middle FM range represents the mean of the d• values for the original data collection and the replication. Wherever performance by a given subject for the larger FM ranges was too high to measure, data for those FM ranges were discarded for all three paradigms. The best fitting linear psychometric function has been drawn through each set of points representing performance in a given paradigm, with the restriction that all three

functions (for each subject) pass through a commonx intercept. Our reasons for imposing this restriction are the same as those cited by Jesteadt and Bilger (1974) for fitting two-point functions with straight lines through the origin.

It is the minimum

restriction

that allows us

to compare performance in different paradigms for a given subject and condition in terms of one parameter

Results

d•/Hz, and in a few cases it reducesthe effect of what

For each level of difficulty, the proportions correct for each of the two possible responses were converted to z scores.

pensate for the difference in performance between paradigms predicted by TSD. We have made no such correc-

Where a general label is needed for more than one of

interval.

of three points on the psychometric function for each of the three paradigms for a given condition (pure-tone or FM signal at 250 or 1000 Hz). A total of 200 trials for each point on each function was obtained by running each of the four conditions on two different days in a counterbalanced order. The eight days of data collection were

B.

term implies that a V• correction has been made to com-

The two z scores

a measure of discrimination

were

then

performance

added to obtain

appears to be poor estimates of performance. The more severe restriction that the functions pass through the origin obviously could not be applied in the FM condi-

consistent

tions.

J. Acoust. Soc. Am., Vol. 57, No. 5, May 1975

Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.59.171.71 On: Thu, 11 Dec 2014 06:58:06

1164

Jesteadtand Sims: Decisionprocesses in frequencydiscrimination

1164

TABLE II. Ratios of the slopes of psychometric functions describing relative frequency-discrimination performance in the different paradigms for each condition in Experiment 1.

4 I I I I IID// I I0It 3 ./•

./

2

Slope ratio

//

Condition

Subject

2IFC/YN

2IFC/SD

SD/YN

PT-250

1

1.98

1.57

1.26

2

2.40

1.72

1.40

3

2.41

1.51

1.59

i .

i

I

i

I

i

i

i

I

_

_

_

_

_

_

PT-1000

_

_

Mean

2.26

1.60

1.42

1

2.57

2.63

0.98

2

2.62

1.42

1.85

3

2.69

'

2.39

1.12

Mean

2.65

2.16

1.32

1

1.47

1.54

0.95

2

0.94

1.00

0.94

3

1.56

1.39

1.12

Mean

1.32

1.31

1.00

_

_

FM-250 i

i

i

i

i

i

i

i

/

2 •,•i'""'

FM-1000

1'

0.96

0.81

1.19

2

1.08

1.08

1.00

3

1.28

1.11

1.16

Mean

1.11

1.00

1.12

_

i ?

4.

i 8

6

FM

RANGE

IN Hz

FIG. 2. FM detection in three paradigrns--2IFC (squares), same-different (diamonds), and yes-no (circles). The six panels contain data for 250 Hz (on the left) and 1000 Hz for the

described in terms of the ratio of the slopes of the psychometric functions. The slope ratio provides an esti-

mate of the ratio of the values of d• that would be ob-

three subjects.

tained

We briefly consider overall differences between conditions before dealing with relations between paradigms. It is obvious from a comparison of Figs. 1 and 2 that performance was generally poorer in the FM conditions than in the pure-tone conditions. While the slopes of the psychometric functions for the FM conditions are generally not as steep as those for the pure-tone conditions, the major difference is that the x-axis intercepts for the FM conditions are markedly greater than zero. Implications of this difference are discussed below.

if the same

increment

size

were

used in both

paradigms, and has the advantage of being based on performance for several increment sizes. Slope ratios for the three possible pairwise comparisons of paradigms are presented in Table II. Only two of these, of course, contain independent information. The third is provided for convenience.

The ratios for the pure-tone conditions in Table II agree with the earlier finding by Jesteadt and Bilger

(1974) that discrimination perœormanceis consistently better in the 2IFC paradigm than in the other two paradigms by factors greater than those predicted by the

To facilitate comparisons across conditions and comparisons with other studies, we used the functions in

traditional TSD analysis, a difference in the opposite direction from that predicted by Wickelgren (1969).

Figs. 1 and 2 to estimate the values of Af/f required for 75% correct performance in 2IFC. These values are presented in Table I. The difference in zXf/f be-

The FM performance ratios are markedly lower than those for the pure-tone conditions. In three out of six

tween the FM and pure-tone conditions is clearly larger at 250 than at 1000 Hz.

The data are therefore

in better

agreement with the. summary presented by Stevens

(1954) than with the summary presented by Nordmark (•968). The relations between any two paradigms can best be

cases, the 2IFC/YN ratio obtained in the FM conditions is as large or larger than 2IFC/YN ratios obtained for detection of pure tones in noise (see Jesteadt and Bilger, 1974, Table I). In the other three cases, performance appears to be uniform across paradigms.

Superficially, at least, the performance ratios for the pure-tone data resemble those for intensity discrimination, while the ratios for the FM data resemble those for detection.

TABLE I.

1

formation but rather to a general difference between dis-

2

3

4

PT-250

0. 0037

0. 0034

0. 0030

0. 0034

0. 0019

0. 0021

0. 0020

0. 0020

FM-250

0. 0166

0. 0245

0. 0121

0. 0177

0. 0098

as

Subject

PT-1000

FM-1000

can be described

1.

Values of Af/f required for 75% correct in 2IFC

for each condition in Experiment

Condition

Since the FM task

detection of modulation, it may be that the difference in performance ratios we obtained is not related specifically to a difference in processing pure-tone and FM in-

0. 0064

0. 0076

0. 0079

crimination and detection tasks. To test this hypothesis, we conducted a second experiment to measure performance in the 2IFC, SD, and YN paradigms in an FM discrimination

task in which the standard

as well

as the

comparison signal was modulated and subjects were asked to discriminate between the two modulation ranges.

J. Acoust. Soc. Am., Vol. 57, No. 5, May 1975

Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.59.171.71 On: Thu, 11 Dec 2014 06:58:06

1165 IV. A.

Jesteadtand Sims: Decisionprocesses in frequencydiscrimination EXPERIMENT

2

1165

TABLE III. Ratios of the slopes of psychometric functions describing relative frequency-discrimination performance in the different paradigms in Experiment 2.

Method

The subjects used in Experiment 1 served as subjects in this experiment. The standard was an FM signal øø Hz range centered on • 000 Hz. T h• modulated over a o•rate of modulation was 8 Hz. The comparison signals

Slope ratio

Subject

2IFC/YN

1

1.67

1.73

0.96

2

2.00

1.54

1.30

3

1.38

1.29

1.07

Mean

1.68

1.52

1.11

were modulated at the same rate over larger ranges and four different comparison ranges were used to obtain four-point psychometric functions for each paradigm. Because individual differences in this task were greater than those encountered in Experiment 1, we used different comparison ranges for each observer. In all other details, Experiment 2 was identical to the FM portion of Experiment 1.

V.

DISCUSSION

B.

A.

Overall performance

Results

The data analysis was also identical to that in Experi-

ment 1. Values of dpare presentedin Fig. 3. The values of Af on the abcissa indicate the differences between the comparison ranges of modulation and the 32-Hz standard range. Table III contains the ratios of the slopes of functions fitted to the data in Fig. 3. The new psychometric

functions have intercepts

closer

to zero and shallower slopes, especially in the SD and YN paradigms.

A comparison of Tables II and III in-

dicates that the 2IFC/SD and the 2IFC/YN ratios are larger for FM discrimination

than for FM detection.

The SD/YN performance ratio showslittle change. While the new performance ratios are not as large as those for pure-tone discrimination, they are larger than both the TSD predictions and the ratios typically obtained

2IFC/SD

SD/YN

for detection of pure tones in noise.

There are several studies in which pure-tone frequency discrimination has been measured using a 2IFC paradigm with signal durations and levels comparable

those in the current study (Harris, 1952; Henning, 1970; Moore, 1973). The variability in the data obtained with 2IFC is much smaller than the variability observed in the frequency-discrimination literature as a whole. These studies, therefore, provide a good benchmark against which to compare new data. The mean Af/f at 250 Hz for 2IFC in the present study (Table I) is 0. 0034. The mean values across subjects for the ear-

lier studies are 0.0031, 0. 0027, and 0. 0028, respec-

tively. The mean Af/f at 1000Hz in Table II is 0. 0020, while the corresponding means for the earlier are 0. 0013, 0.0014, and 0.0018. The quality of the FM data is more difficult

studies to evaluate.

We are not aware of any FM studies that have used procedures comparable to those in the present study. The

values of Af/f for the FM conditions in Table I are larger than those in the literature, but all of the previously published values were obtained with the method

of adjustment.

We know that pure-tone measures of

Af/f obtainedwiththe methodof adjustment(Nordmark, 1968; Rakowski, 1971) are generally smaller than those obtainedwith 2IFC. The difference in procedure, therefore, may accountfor the larger values of Af/f we observed. •' The relation between discrimination at 250 and 1000 Hz for the FM conditions

in Table I is similar

to that observed in previous FM studies. The relatively poor performance in detecting frequency modulation at low center frequencies is apparently not a function of procedure.

There is general agreement across all of the studies in the literature that pure-tone procedures result in

smaller values of Af/f than FM procedures.

The data

in the present study indicate that an important factor contributing to this difference is the nonzero intercept of the psychometric functions for FM detection. No matter what scales were used in plotting the FM data for Experiment 1, at least two of our three subjects would show chance performance for the smallest FM range.

FIG. 3. FM discrimination in three paradigms--2IFC

(squares), same-different (diamonds),and yes-no (circles). The three panels containdata for the three subjects.

The large difference in intercepts for the pure-tone and FM psychometric functions may give an indication

J. Acoust. Soc. Am., Vol. 57, No. 5, May 1975

Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.59.171.71 On: Thu, 11 Dec 2014 06:58:06

1166

of how subjects are performing the FM task. If the subjects perform both the pure-tone and the FM tasks by timing the periods of individual cycles, as Nordmark (1968) has suggested, then we would expect to see psychometric

functions

in both

tasks

with

a common

inter-

cept near zero and a difference only in slope. The fact that this is not the case may indicate that the subjects are performing a spectral analysis of the FM signal,

as Kock (1937) and Feth, Wolf, and Bilger (1969) have suggested.

B. Relations between paradigms The primary goals of the present study were to replicate one condition of the study reported by Jesteadt

and Bilger (1974) and to extend the range of conditions to include low-frequency signals and FM. The performance ratios for the replicated condition, pure-tone

frequency discrimination at 1000 Hz, are in agreement with

1166

Jesteadt and Sims: Decision processesin frequency discrimination

the earlier

data that

indicate

that

the relations

be-

tween paradigms are similar to those for intensity discrimination. The pure-tone data for 250 Hz show the same relations as those for 1000 Hz, within the limits of the variability always observed in performance ratios. 3

The pure-tone data do not support Wickelgren's (1969) model of pitch memory and decision processes in the same-different paradigm. The upper and lower limits of performance in SD can always be specified in terms

of performance in 2IFC or YN.

There are, therefore,

no effects unique to $D that require an independent model for that paradigm. The data do support Wickelgren's assertion that memory factors play an important role in frequency-discrimination tasks. The most straightforward way of accounting for the pure-tone data is to assume that the TSD model of the decision process is correct, but that memory for the standard is far from perfect. If the memory factor is considered as an additional source of decision variance and some assumptions are made concerning its magnitude relative to the variance in a single observation, specific predictions can again be made concerning relations between paradigms. Vie-

meister (1970) has discussed the special case where memory variance

and observation variance

are equal.

In this case, the 2IFC/YN performance ratio shouldbe 2.0, while the SD/YN ratio shouldremain 1.0. In the SD paradigm, the decision variance is the same whether the observer

listens

to the first

interval

or relies

on

-

memory for the standard as he would in YN. In general, as memory variance increases from zero to some amount

equal to the observationvariance, the 2IFC/YN and 2IFC/SD ratios shouldincreasefrom the factor of • predicted by TSD to the factor of 2.0 predicted by Viemeister (1970). As memory variance increases beyond

observationvariance, the 2IFC/YN and SD/YN ratios shouldincrease indefinitely, while the 2IFC/SD ratio remains constant at 2.0.

For memory variance in this

range, the subject should listen to the standard in SD, rather than relying on memory. The subject should then be performing the same operations in SD and 2IFC, and the relation between these two paradigms should be governed by the use of the order information in 2IFC,

not by memory for the standard.

When the SD/YN ratio is significantly greater than 1.0, it can be used to estimate the magnitude of the memory variance

in YN relative

to the sensation variance

in a singleobservation.4 Usingthe meanSD/YN ratio for both pure-tone conditions, 1.37, we estimate that the memory variance exceeds the observation variance in YN by a factor of 2.75. If we were to assume that memory variance plays some role in SD as well as YN, our estimate of the relative magnitude of the memory variance would increase. The pure-tone data are therefore consistent with the assumption that memory variance not only exists, but is greater than the sensory variance in a single observation. A possible interpretation of the memory factor is discussed below. The relations between paradigms observed in the FM conditions indicate still another way in which standard pure-tone and FM measures differ from one another. While the pure-tone conditions showed paradigm effects much larger than predicted on the basis of TSD, the FM conditions showed effects that were smaller than predicted

and in some

cases

nonexistent.

As we noted

above, paradigm comparison studies involving detection of pure tones also have reported effects smaller than the TSD predictions.

The range of paradigm effects observed in the two experiments indicates that there are no fixed values for the ratios describing relative performance in different paradigms. The results suggest that at least two factors govern the size of the performance ratios. One of these is memory variance. For the most part, the pattern of performance ratios in the FM detection, FM discrimination, and pure-tone discrimination conditions can be accounted for by assuming that these tasks fall at different points on a memory-variance continuum. The small paradigm effects observed in the FM detection conditions suggest that memory for the standard is essentially perfect. In the FM discrimination condition, memory variance appears to be present, but is less than the observation variance. In the pure-tone discrimination conditions, as we noted above, memory variance appears to be greater than observation variance. The memory-variance assumption cannot account for the fact that in the FM detection task, and in the detec-

tion of pure-tones in noise, the 2IFC/YN and 2IFC/SD performance ratios are less than TSD would predict. The prediction that performance is better in 2IFC than in SD or YN reflects the assumption that the decision is based

on the difference

in the first

between

and second intervals.

the observations

In 2IFC

made

this difference

can be either positive or negative, while in SD or YN (where memory for the standard replaces one of the intervals) it can only be positive or zero. The fact that the 2IFC/YN or 2IFC/SD ratios are less than the pre-

dictedV•- in detectiontasks suggeststhat subjectscannot make the necessary subtraction consistently.

In dis-

crimination tasks where the 2IFC/YN and 2IFC/SD ra-

tios are greater thanthe predictedVr•-, it is difficultto determine

the relative

contributions

of order

information

and memory variance to the ratios. When the SD/YN ratio is greater than 1, however, it is possible to sort

J. Acoust. Soc. Am., Vol. 57, No. 5, May 1975

Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.59.171.71 On: Thu, 11 Dec 2014 06:58:06

1167

Jesteadtand Sims: Decisionprocesses in frequencydiscrimination

out the two factors. In this case, the subject is assumed to be listening to both intervals in SD and 2IFC with the same memory effects in both paradigms. If the subject is making full use of the order information in 2IFC, the

2IFC/SD ratio shouldbe 2. 0. The mean 2IFC/SD ratio for both pure-tone conditions in Experiment i is 1.88. This suggests that the subjects are making better use of the order information in 2IFC in the pure-tone discrimination

task

than

in the

FM

detection

task.

C. The memory factor The data presented here suggest that there is greater memory variance in discrimination tasks than in detection tasks. The interpretation of this factor that seems most compatible with the other TSD assumptions is to consider assume

it as criterion that

the

variance.

criterion

It is reasonable

is more

stable

when

to

the two

signals produce sensations that differ qualitatively as well as quantitatively. A stable criterion based on a qualitative difference is, of course, a characteristic of

categorical perception (Studdert-Kennedy et al., Our interpretation

of the difference

and discrimination

tasks

has much

recent analysis of categorical

sounds(Miller et al.,

1970).

between detection in common

with

a

perception of nonspeech

1973). It shouldbe noted that

their analysis included detection of pure tones in noise as an example. VI.

CONCLUSIONS

The experiments

reported here indicate the following.

(1) The results obtained with pure-tone and FMsignals differ in many ways, even if the signals are presented in identical paradigms. The pure-tone and FM data differ in terms of overall performance, performance as a function of frequency, performance as a function of paradigm, and the nature of the psychometric function.

(2) The variability across signal conditions and paradigms is large with respect to the variability across subjects. The data for pure-tone discrimination in the

2IFC paradigm can be added to a growing list of highly consistent

2IFC

data collected

frequency-discrimination

in a number

of different

studies.

(3) The differences in frequency discrimination as a function of paradigm are in the direction predicted by

TSD, not in the direction predicted by Wickelgren (1969). (4) The effect of paradigm is larger than TSD would predict for both pure-tone and FM discrimination, but smaller than TSD would predict for FM detection. This suggests that memory variance is entering into the decision process in some paradigms in the discrimination task, but not in the detection task.

1167

ACKNOWLEDGMENTS

This research was supported by grants from the Na-

tional Institutes of Health, Public Health Service, U.S. Department of Health, Education and Welfare, and the National

Science

Foundation.

The authors

wish to ex-

press their appreciation to Robert C. Bilger, David M. Green, and Craig C. Wier for many helpful comments on earlier

versions of this paper.

*Now at the Department of Psychology and Social Relations, Harvard University, Cambridge, MA 02138.

lWe use rangeof modulation here to refer to the difference between the highest and lowest excursions of the carrier frequency. This usage is consistent with that of previous FM studies. The procedures for calibrating range of modulation were similar to those described by Feth, Wolf, and

Bilger (1969).

2Thefact thatwe usedan 8-Hz modulation rate, rather than the 2-Hz rate used by Shower and Biddulph (1931) and Filling (1958), may also accountfor the larger values of Af/f we obtained. The calibration procedure we followed, however, should have eliminated the potential modulation-rate effect (Feth, Wolf, and Bilger, 1969).

3Wehavenotfoundanydiscussionin the literature of the fact that this variability is inherent in ratios of estimates of performance. If the error associated with each performance estimate is normally distributed, then the error associated with the ratio estimate will have a Cauchy distribution (see Cramer, 1946, p. 246). This distribution has infinite variance and, as a result, the law of large numbers does not hold. Mean ratios across subjects or conditions should therefore show the same variability as individual ratios. In reality,

the use of small numbers of trials

tend to limit

the size of the errors

associated

and other factors with

individual

performance estimates, so the variance of ratio estimates is less than infinite and there is some small advantage in considering mean ratios. The inherent variability cannot be avoided because there is no way of comparing paradigms that does not implicitly or explicitly involve ratios.

4Thecalculationof the memoryvariancerelative to the sensation variance is as follows. Assume that both dsD and dyN are given by the difference in mean sensations divided by the square root of the sum of the variance terms.

In the

YN case, there are two variance terms, one representing the sensation variance in a single observation and the other representing memory variance. In the SD case, the memory variance is assumed to be negligible, but the observation variance is doubled. Therefore,

d,SD_ dyN ,

•/O+M ,/2 ß 0

where 0 is the variance of a single observation and M is the memory variance. It follows directly that

The ratio of dsD to dy• is equivalent to the ratio of the slopes of the psychometric functions.

(5) A similar pattern of paradigm effects is observed in the literature on pure-tone detection and intensity discrimination. This suggests that the additional source of variance is not related to a specific signal ensemble but is central in origin. The approach most compatible with TSD is to interpret the memory variance as criterion

variance

stable

and to assume

in detection

tasks

that

the criterion

than discrimination

is more

tasks.

Cramer, H. (1946). Mathematical Methods of Statistics (Princeton U. P., Princeton, NJ). Feth, L. L. Wolf, R. V., and Bilger, R. C. (1969). "Frequency Modulation and the Difference Limen for Frequency," J. Acoust.

Soc. Am. 45, 1430-1437.

Filling, S. (1958). Difference Limen for Frequency (Andelsbogtrykkeriet-Copenhagen, I Odense). Green, D. M., and Swets, J.A. (1966). Signal Detection The-

J. Acoust.Soc. Am., Vol. 57, No. 5, May 1975 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.59.171.71 On: Thu, 11 Dec 2014 06:58:06

1168

Jesteadtand Sims: Decision processesin frequency discrimination

ory Psychophysics (Wiley, New York). Groen, J. J., and Versteegh, R. M. (1957). "Frequency Modulation and the Human Ear,"

Acta Otolaryngol.

47, 421-430.

Harris, J. D. (1952). "Pitch Discrimination," J. Aeoust. Soe. Am. 24, 750-755.

Von Helmholtz, H. (1863). On the Sensations of Tone as a Physiological Basis for the Theory of Music (Dover, New

York), first Americaned.--•954. Henning, G.B.

(1966). "Frequency Discrimination of Random

Amplitude Tones," J. Acoust. Soc. Am. 39, 336-339.

Henning, G.B.

(1970). "A Comparison of the Effects of Signal

Duration on Frequency and Amplitude Discrimination," Frequency Analysis

and Periodicity

in

Detection in Hearing,

R.

Plomp and G. F. Smoorenburg, Eds. (Sijthoff, Lieden). Jesteadt, W. (•972). "SNP: A System for NonadaptiveProcedures,"

Behar.

Res. Meth. Instru.

4, 209-213.

Jesteadt, W., and Bilger, R. C. (1969). "Frequency Discrimination Near Masked Threshold,"

Percept.

Psychophys. 6,

405-408.

Jesteadt, W., and Bilger, R. C. (1974). "Intensity and Frequency Discrimination in One- and Two-Interval J. Acoust. Soc. Am. 55, 1266-1276.

Paradigms,"

Kock, W. E. (1937). "A New Interpretation of the Results of Experiments on the Differential Pitch Sensitivity of the Ear," J. Acoust. Soc. Am. 9, 129-134.

Leshowitz, B. (1969). "Comparison of ROC Curves from Oneand Two-Interval Rating-Scale Procedures," Am. 46, 399-402.

J. Acoust. Soc.

1168

der Vorrichtungen, "Ann. Phys. Chem. 59, 513-565. Pierce, J. R., and Gilbert, E. N. (1953). "On AX and ABX Limens, "J. Acoust. Soc. Am. 30, 593-595. Pynn, C. T., Braida, L. D., and Durlach, N. I. (1972). 'fintensity Perception. III. Resolution in Small-Range Identification, "J. Acoust. Soc. Am. 51, 559-566. Rakowski, A. (1971). "Pitch Discrimination at the Threshold of Hearing," Proc. Int. Cong. Acoust., Budapest, 373--376. Riesz, R. R. (1928). "Differential Intensity Sensitivity of the Ear for Pure Tones," Phys. Rev. 31, 868-875. Rosenblith, W. and Stevens, K. N. (1953). "On the DL for Frequency, "J. Acoust. Soc. Am. 25, 980-985. Schuman, A. I., and Mitchell, R. R. (1966). "Operating Characteristics from Yes-No and Forced-Choice Procedures," J. Acoust. Soc. Am. 40, 473--477. Shower, E.G., and Biddulph, R. (1931). "Differential Pitch Sensitivity of the Ear," J. Acoust. Soc. Am. 3, 275-287. Sorkin, R. D. (1962). "Extension of the Theory of Signal Detectability to Matching Procedures in Psychoacoustics, "J. Acoust. Soc. Am. 34, 1745-1751.

Stevens, S.S. (1954). "Pitch Discrimination, Mels, and Kock's Contention, "J. Acoust. Soc. Am. 26, 1075-1077. Stevens, S.S. (1957). "On the Psychophysical Law," Psychol. Rev. 64, 153-181. Studdert-Kennedy, M., Liberman, A. M., Harris, K., and Cooper, 'F. S. (1970). "The Motor Theory of Speech Percep-

tion: A Replyto Lane'sCriticalReview,"Psychol.Rev. 77, 234-249.

Long, G.R. (1973). "The Role of the Standardin an Auditory

the Slope of the Empirical ROC Curves: Comparisons of Binary and Rating Responses," Percep. Psychophys. 2, 91-

Swets, J. A., and Green, D. M. (1961). "Sequential Observations by Human Observers of Signals in Noise," in Information Theory, C. Cherry, Ed. (Butterworths, London). Swets, J. A., Tanner, W. P., Jr., and Birdsall, T. G. (1961). "Decision Processes in Perception," Psychol. Rev. 68, 301-340.

100.

Tanner, W. P., Jr., and Rivette, C. L. (1964). "Experi-

Amplitude Discrimination

Experiment,"

Percep. Psychophys.

13, 49-59.

Markowitz, J., and Swets, J.A.

(1967). "Factors Affecting

Meurman, O. H. (1954). "The Difference Limen of Frequency in Tests of Auditory Function," Acta Otolaryngol.

Suppl. 118,

144-155.

Miller,

J. D.,

Wier,

C. C.,

Pastore,

R. E.,

Kelly, W. J.,

and Dooling, R. J. (1974). "Discrimination and Labeling of Noise-Buzz Sequences with Varying Noise-Lead Times: An Example of Categorical Perception," J. Acoust. Soc. Am. 55, 390.

Moore, B. C. J. (1973). "Frequency DLs for Short Duration Tones," J. Acoust. Soc. Am. 54, 610--619. Nordmark, J. O. (1968). "Mechanisms of Frequency Discrimination," J. Acoust. Soc. Am. 44, 1533--1540. Ohm, G. S. (1843). "Uber die Definition des Tones, nebst daran geknupfter Theorie der Sirene und ahnlicker ton bilden-

mental Study of •Tone Deafness, •" J. Acoust. Soc. Am. 36, 1465--1467.

Viemeister, N. F. (1970). '•ntensity Discrimination: Performance in Three Paradigms," Percep. Psychophys. 8, 417419.

Wald, A. (1950). Statistical Decision Functions (Wiley, New York).

Whipple, G.M. (1901). "An Analytical Studyof the Memory Image and the Process of Judgment in the Discrimination of Clangs and Tones," Am. J. Psychol. 12, 409-457.

Wickelgren, W. A. (1969). "Associative Strength Theory of Recognition Memory for Pitch, "J. Math. Psychol. 6, 13-61. Wolf, R. V. (1972). "A Digitally Programmable Attenuator," Behav. Res. Meth.

Instrum.

4, 278.

J. Acoust.Soc. Am., Vol. 57, No. 5, May 1975

Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.59.171.71 On: Thu, 11 Dec 2014 06:58:06

Decision processes in frequency discrimination.

Decision processes in frequency discrimination Wait Jesteadt* and Sandra L. Sims Bioacoustics Laboratory, Eye and Ear Hospital and Schoolof Medicin...
1MB Sizes 0 Downloads 0 Views