Publications by authors named "Huanping Dai"

Musicians perform better than non-musicians on a variety of non-musical sound-perception tasks. Whether that musicians' advantage extends to spatial hearing is a topic of increasing interest. Here we investigated one facet of that topic by assessing musicians' and non-musicians' sensitivity to the two primary cues to sound-source location on the horizontal plane: interaural-level-differences (ILDs) and interaural-time-differences (ITDs).

View Article and Find Full Text PDF
Article Synopsis
  • Study explored how multiple randomly chosen sounds (maskers) affect the ability to detect a specific tone (1000 Hz) among them in normal-hearing adults.
  • Results showed significant improvements in detection sensitivity (26 dB) with increased signal duration, although individual responses varied widely.
  • Three signal-detection models were analyzed, revealing that while template-matching models could predict the observed shallow response patterns, they, along with other models, struggled to explain the steepness of the temporal integration noted in the study.
View Article and Find Full Text PDF

A previous modelling study reported that spectro-temporal cues perceptually relevant to humans provide enough information to accurately classify "natural soundscapes" recorded in four distinct temperate habitats of a biosphere reserve [Thoret, Varnet, Boubenec, Ferriere, Le Tourneau, Krause, and Lorenzi (2020). J. Acoust.

View Article and Find Full Text PDF

The two primary cues to sound-source location on the horizontal plane are interaural time differences (ITDs) and interaural level differences (ILDs). Here we asked whether the ability to discriminate small changes in each of these interaural cues differs between the sexes. We tested one group of males (n = 43) and females (n = 94) on ITD discrimination at 0.

View Article and Find Full Text PDF

Most sounds fluctuate in amplitude, but do listeners attend to the temporal structure of those fluctuations when trying to detect the mere presence of those sounds? This question was addressed by leading listeners to expect a faint sound with a fixed temporal structure (pulse train or steady-state tone) and total duration (300 ms) and measuring their ability to detect equally faint sounds of unexpected temporal structure (pulse train when expecting steady state) and/or total duration (<300 ms). Detection was poorer for sounds with unexpected than with expected total durations, replicating previous outcomes, but was uninfluenced by the temporal structure of the expected sound. The results disagree with computational predictions of the multiple-look model, which posits that listeners attend to both the total duration and temporal structure of the signal, but agree with predictions of the matched-window energy-detector model, which posits that listeners attend to the total duration but not the temporal structure of the signal.

View Article and Find Full Text PDF

As a practical guide, a formula is provided for averaging multiple estimates of decision weights, for the purpose of improving the accuracy and reliability of the final estimate of decision weights. The averaging over multiple weight estimates can take place either within or across listeners.

View Article and Find Full Text PDF

The neural basis of statistical learning as it occurs over time was explored with stimuli drawn from a natural language (Russian nouns). The input reflected the "rules" for marking categories of gendered nouns, without making participants explicitly aware of the nature of what they were to learn. Participants were scanned while listening to a series of gender-marked nouns during four sequential scans, and were tested for their learning immediately after each scan.

View Article and Find Full Text PDF

Proportion correct (Pc) is a fundamental measure of task performance in psychophysics. The maximum Pc score that can be achieved by an optimal (maximum-likelihood) observer in a given task is of both theoretical and practical importance, because it sets an upper limit on human performance. Within the framework of signal detection theory, analytical solutions for computing the maximum Pc score have been established for several common experimental paradigms under the assumption of Gaussian additive internal noise.

View Article and Find Full Text PDF

Monaural envelope correlation perception is the ability to discriminate between stimuli composed of two or more bands of noise based on envelope correlation. Sensitivity decreases as stimulus bandwidth is reduced below 100 Hz. The present study manipulated stimulus bandwidth (25-100 Hz) and duration (25-800 ms) to evaluate whether performance of highly trained listeners is limited by the number of inherent modulation periods in each presentation.

View Article and Find Full Text PDF

The optimal integration of information from independent Poisson sources (such as neurons) was analyzed in the context of a two-interval, forced-choice detection task. When the mean count of the Poisson distribution is above 1, the benefit of integration is closely approximated by the predictions based on the square-root law of the Gaussian model. When the mean count falls far below 1, however, the benefit of integration clearly exceeds the predictions based on the square-root law.

View Article and Find Full Text PDF

Psychophysical "reverse-correlation" methods allow researchers to gain insight into the perceptual representations and decision weighting strategies of individual subjects in perceptual tasks. Although these methods have gained momentum, until recently their development was limited to experiments involving only two response categories. Recently, two approaches for estimating decision weights in m-alternative experiments have been put forward.

View Article and Find Full Text PDF

A fundamental issue in the design and the interpretation of experimental studies of perception relates to the question of whether the participants in these experiments could perform the perceptual task assigned to them using another feature, or cue, than that intended by the experimenter. An approach frequently used by auditory- and visual-perception researchers to guard against this possibility involves applying random variations to the stimuli across presentations or trials so as to make the "unwanted" cue unreliable for the participants. However, the theoretical basis of this widespread practice is not well developed.

View Article and Find Full Text PDF

The form of the psychometric function (PF) for auditory frequency discrimination is of theoretical interest and practical importance. In this study, PFs for pure-tone frequency discrimination were measured for several standard frequencies (200-8000 Hz) and levels [35-85 dB sound pressure level (SPL)] in normal-hearing listeners. The proportion-correct data were fitted using a cumulative-Gaussian function of the sensitivity index, d', computed as a power transformation of the frequency difference, Δf.

View Article and Find Full Text PDF

The theoretical limits to the amount of error, or the Cramer-Rao bounds, were derived for estimating psychometric functions. These theoretical error bounds were compared with the variability of psychometric functions estimated from human as well as computer-simulated observers. For the simulated observers, due to the limited efficiency of the sampling strategies, including the placement of the signals and the distribution of the trials, the variances of the estimated parameters are seven times the theoretical bound for threshold and 22 times that for slope.

View Article and Find Full Text PDF

Perceptual weights were estimated in a pitch-comparison experiment to assess the relative influences of individual partial tones on listeners' pitch judgments. The stimuli were harmonic sounds (F0=200 Hz) with partials up to the 12th. Low-numbered partials were removed step-by-step, so that the remaining higher-numbered partials would have a better chance of showing any effect.

View Article and Find Full Text PDF

Psychophysical reverse-correlation methods such as the "classification image" technique provide a unique tool to uncover the internal representations and decision strategies of individual participants in perceptual tasks. Over the past 30 years, these techniques have gained increasing popularity among both visual and auditory psychophysicists. However, thus far, principled applications of the psychophysical reverse-correlation approach have been almost exclusively limited to two-alternative decision (detection or discrimination) tasks.

View Article and Find Full Text PDF

A major concern when designing a psychophysical experiment is that participants may use a stimulus feature (cue) other than that intended by the experimenter. One way to avoid this problem is to apply random variations to the corresponding feature across stimulus presentations to make the unwanted cue unreliable. An important question facing experimenters who use this randomization (roving) technique is how large the randomization range should be to ensure that the participants cannot achieve a certain proportion correct by using the unwanted cue, while at the same time avoiding unnecessary interference of the randomization with task performance.

View Article and Find Full Text PDF

When a random rove is used in a perceptual task to control the influence of an unwanted cue that may confound the decision strategy of primary interest, the effectiveness of the rove is determined by its range. Green [Profile Analysis (Oxford University Press, Oxford, 1988)] provided a formula which allows experimenters to determine the roving range required to ensure that the listeners relying on the unwanted cue cannot exceed a pre-defined percentage of correct responses in a two-interval, two-alternative forced-choice experiment. Here, Green's analysis is extended to the yes-no and m-alternative, forced-choice paradigms (m>2).

View Article and Find Full Text PDF

The equal-variance Gaussian signal detection theory (SDT) decision model for the dual-pair (4IAX) change-detection paradigm has been described in earlier publications. In this research article, we consider the equal-variance Gaussian SDT model for the related 4IAX AB-versus-BA identification paradigm. The likelihood ratios, optimal decision rules, receiver-operating characteristics (ROCs), and relationships between d' and proportion correct (PC) are analyzed for two special cases: that of statistically independent observations, which typically applies in constant-stimuli experiments, and that of highly correlated observations, which typically applies in experiments where stimuli are roved widely across trials or pairs.

View Article and Find Full Text PDF

In certain perceptual discrimination tasks, a change in a particular stimulus variable can be perceived as changes along multiple perceptual dimensions. If the study is primarily concerned with a particular perceptual dimension or cue, it is important that the experimenter keep the influences of the other unwanted but correlated perceptual cues under control. One way to accomplish this objective is to randomize the stimuli along the stimulus dimensions primarily associated with these unwanted cues, making them unreliable as a basis for the discrimination.

View Article and Find Full Text PDF

According to a well-known theorem in psychophysics (Green & Swets, 1966), the area under the receiver operating characteristic (ROC) for the yes-no paradigm equals the proportion of correct responses of an unbiased observer in the two-interval, two-alternative, forced choice paradigm (2I2AFC). Here, we demonstrate a similar relationship between the ROC area in the two-interval same-different (AX or 2IAX) paradigm, and the proportion correct in the four-interval same-different (4IAX, also known as dual-pair comparison) paradigm. The theorem demonstrated here is general, in the sense that it does not require that the sensory observations have a specific distribution (e.

View Article and Find Full Text PDF