Event-Related Potentials and Behavioral Responses to CV Stimuli Straddling Category Boundary

Ji Young Lee; Mark S. Hedrick; Ashley W. Harkrider

doi:10.12963/csd.19593

Commun Sci Disord > Volume 24(1); 2019 > Article

범주 경계 사이에 있는 말 자극에 대한 사건관련전위 및 변별 반응

Original Article

Commun Sci Disord 2019; 24(1): 129-140.

Published online: March 31, 2019

DOI: https://doi.org/10.12963/csd.19593

범주 경계 사이에 있는 말 자극에 대한 사건관련전위 및 변별 반응

LeeJi Young^a

, HedrickMark S.^b

, HarkriderAshley W.^b

^a대구가톨릭대학교 언어청각치료학과

^bUniversity of Tennessee Health Science Center

Event-Related Potentials and Behavioral Responses to CV Stimuli Straddling Category Boundary

Ji Young Lee^a

, Mark S. Hedrick^b

, Ashley W. Harkrider^b

^aDepartment of Audiology and Speech-Language Pathology, Daegu Catholic University, Gyeongsan, Korea

^bDepartment of Audiology and Speech Pathology, University of Tennessee Health Science Center, Knoxville, TN, USA

Correspondence: Ashley W. Harkrider, PhD Department of Audiology and Speech Pathology, University of Tennessee Health Science Center, 578 South Stadium Hall, UT Knoxville, TN 37996-0740, USA Tel: +1-865-974-1810 Fax: +1-865-974-1539 Email: aharkrid@uthsc.edu

Portions of the results from the present study were presented as a poster at the 2013 Meeting of International Evoked Response Audiometry Study Group, New Orleans, USA.

Received January 18, 2019 Revised February 21, 2019 Accepted March 6, 2019

This is an Open Access article distributed under the terms of the Creative Commons Attribution NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

초록

배경 및 목적

P300는 진폭이 크고 반복 측정되는 사건관련전위 중 하나이다. 본 연구의 목적은 범주 경계 사이에 있는 말 자극을 사용하였을 때에 P300가 자극의 이탈 정도와 두피 위치에 따라 다르게 나타나는지, 변별에 대한 행동 반응이 자극의 이탈 정도에 따라 다르게 나타나는지, P300와 행동 반응이 상관관계를 보이는지를 알아보는 것이다.

방법

정상청력을 가진 성인 20명으로부터 P300와 변별 반응에 대한 데이터를 아드볼 패러다임으로 측정하였다. 9단계의 /bi/-/pi/ 연속체에 대한 라벨링 검사 결과, 범주 내에 있는 1개의 CV를 표준자극으로, 범주 경계에 있는 2개의 CVs를 각각 약한 이탈 자극(WD)과 강한 이탈 자극(SD)으로 사용하였다.

결과

P300 진폭은 WD보다 SD에 대해 더 컸으며, 전두부보다 중앙부와 두정부에서 크게 나타났다. P300 잠복기는 WD보다 SD에 대해 더 짧았으며, 두피 위치에 따른 차이는 없었다. 변별 정확도는 WD보다 SD에서 더 높았고, 반응시간은 WD보다 SD에서 더 빨랐다. 그리고 반응시간과 P300 진폭은 음의 상관관계를, 반응시간과 P300 잠복기는 양의 상관관계를 보였다.

논의 및 결론

P300의 진폭과 잠복기는 미세한 음성학 변화에 대한 지각적 처리를 반영하며, 반응시간과 상관관계를 가진다.

Abstract

Objectives

The present study aims to examine in depth the nature and characteristics of the P300 response reflected in the processing of across-category speech stimuli by determining whether the P300 is different for the extent of deviancy of stimuli and a scalp distri-bution; the behavioral response is different for the extent of deviancy of stimuli; and the P300 is correlated with the behavioral response.

Methods

The P300 and behavioral data were measured in an oddball paradigm from twenty normal-hearing adults. From behavioral labeling test results of a nine-step series /bi/-/pi/ continuum, one within-category CV was selected as a standard stimulus and two across-category CVs were selected as weak (WD) and strong deviant stimuli (SD), respectively.

Results

The P300 amplitude was larger for SD than for WD and on central and parietal regions than frontal region while the P300 latency was shorter for SD than for WD with no difference for the scalp distribution. The accuracy response was greater and reaction times were faster for SD than for WD. And reaction times were negatively correlated with P300 amplitude and positively correlated with P300 latency.

Conclusion

The results suggest that the P300 amplitude and latency can reflect perceptual processing of subtle phonetic changes in across-category speech stimuli, and they are correlated with reaction times.

Keywords: Event related potentials, P300, Oddball paradigm, Speech processing, Categorical perception, Voice onset time

The human brain may perceive a gradual change of a sensory variable as discrete categories, not as continuous change. This is called categorical perception because a great amount of continuous sensory input is processed into a fixed abstract category rather than detailing the physical reality itself in the brain. Thus, two pairs of stimuli having the same absolute physical differences can be placed into the same or different categories by the category system represented in the brain. Also, the discrimination between stimuli can be better when they are across different categories than when they are within the same category. Categorical perception helps the brain process efficiently by picking up important information only, ignoring unnecessary information (Harnad, 1987, 2003; Liberman, Harris, Hoffman, & Griffith, 1957). Categoricallike perception is well known as an information process in human cognitive activity. Computational modeling has shown that both competitive networks and back-propagation systems show processes similar to categorical perception (Damper & Harnad, 2000). This pattern of information processing occurs on various sensory stimuli varying along a continuum (Bornstein & Korda, 1984; Etoff & Magee, 1992; Young et al., 1997). In particular, category perception of speech has been studied widely to examine the speech process since Liberman et al. (1957) revealed a synthetic /ba/-/da/- /ga/ continuum was perceived categorically, not continuously.

Studies accumulating neural data correlated with categorical perception have employed event related potentials (ERPs). ERPs refer to the electrical voltages elicited in the brain in response to some specific events or stimuli (Blackwood & Muir, 1990). ERPs have been one of the most powerful tools of examining cognitive processing of sensory information in the brain. The ERP components are defined by amplitude and latency. Amplitude is measured in microvolts by quantifying the maximum or minimum peak of voltage generated after the stimulus is presented, while the latency is obtained in milliseconds (ms) by measuring the time taken until the response is elicited after the stimulus is presented. ERPs— with the advantages of its high temporal resolution, noninvasive procedure, and relatively inexpensive cost— have contributed to the investigation of the changes of neural representation in the brain with the advantages of its high temporal resolution, noninvasive procedure, and relatively inexpensive cost. Thus, ERPs have been used in studying neural activity associated with a variety of speech and language processing tasks (Mueller, 2005). Behavioral tests of speech perception provide only the information about the final results of the whole speech perception processing, which cannot unveil the fundamental neural mechanism and subprocess underneath the end response. ERPs would be helpful in clarifying the observed speech and language outcomes by elucidating possible cortical neurophysiologic processing. Also, ERPs could be a viable option to evaluate speech perception for those who have difficulty in performing behavioral response due to physical or motor-skills problems or for those whose traditional behavioral performance is not enough to explain the outcomes in reality.

The P300 is one of the most robust and reliable peaks in the ERP repertoire (Patel & Azzam, 2005). Since the P300 was first described (Sutton, Braren, Zubin, & John, 1965), it has been widely used as a method of monitoring brain responses. Though the P300 is elicited at around 300 ms after stimulus onset, the latency can be recorded as late as 600 ms depending on the stimuli or events necessary performing some specific cognitive processes (Hall, 2007). The P300 is commonly elicited by an oddball paradigm in which a deviant stimulus with low probability of presentation is presented randomly in a series of a standard stimulus with high probability of presentation. The amplitude of P300 becomes greater as task demand or difficulty is reduced, while the latency of P300 becomes shorter as speed of information processing is increased (McPherson, Ballachanda, & Kaf, 2000; Polich, 2007). Whereas the P3a, an involuntary subcomponent of P300, is robust over the frontal or fronto-central scalp areas, the P3b, a voluntary response elicited by the oddball paradigm, has centro-parietal scalp topography (Bennington & Polich, 1999; Duncan et al., 2009). The current study follows the conventional referral of the P3b as the P300 response.

Numerous studies have suggested that the P300 can be a clinical tool to evaluate auditory processing, measuring the P300 in a variety of patients with hearing impairment, developmental dyslexia, language disorders, learning disability, attention deficit hyperactivity disorders (ADHD), Alzheimer's disease, schizophrenia, etc. (Alvarenga, Araújo, Ferraz, & Crenitte, 2013; Groenen, Beynon, Snik, & van den Broek, 2001; Hutchinson & McGill, 1997; Jeon & Polich, 2003; Purdy, Kelly, & Davies, 2002; Salamat & McPherson, 1999; Wible, Nicol, & Kraus, 2005). Groenen et al. (2001) reported that the prolonged P300 in cochlear implant users compared to people with normal hearing suggests that the P300 provides additional information about speech perception upon behavioral speech recognition testing. Alvarenga et al. (2013) examined the therapeutic effectiveness of a phonological remediation program, using the P300 and a phonological awareness test. Their results showed that the children with developmental dyslexia who received a phonological remediation program showed greater responses on the phonological awareness test and a shorter P300 latency compared to the control group with no phonological remediation program. In conjunction with behavioral methods, electrophysiological methods have contributed to the integrated understanding about of speech perception. Speech perception is a very dynamic and complex mechanism associated with auditory, linguistic, and cognitive processing. Collaborated approaches using both behavioral and electrophysiological measures could provide more detailed information about speech perception for differential diagnoses, monitoring, and rehabilitation of people with auditory processing problems (Horev, Most, & Pratt, 2007). In spite of the possibilities of clinical utility of the P300, however, the methods and procedures for the its clinical application have not yet been standardized. Thus, with the technical advances, further studies are required to get better insight into speech perception on the P300 (Duncan et al., 2009; Kalaiah & Shastri, 2016).

A lot of researchers have examined the extent of P300 sensitivity to perceptually deviant contrasts and the association of P300, by varying the physical features of auditory stimuli. Tone bursts have been most commonly used as stimuli to elicit the P300. Using 1,000 Hz standard stimuli and 2,000 Hz deviant stimuli, Purdi et al. (2002) showed that the P300 amplitude was reduced and the P300 latency was prolonged in people with learning disabilities. To study how the P300 and behavioral performance are associated, White, Stuart, & Najem (2010) constructed behaviorally perceptible tone contrasts and imperceptible tone contrasts. In a listening oddball paradigm, they reported that the P300 was elicited by behaviorally discriminable tone contrasts while no P300 was elicited by behaviorally imperceptible tone contrasts. Their results showed that the P300 is sensitive to stimulus probability and P300 has good correlation with behavioral performance when tone stimuli were used as stimuli. Kalaiah & Shastri (2016) studied P300 elicited at Fz, Cz, C3, C4, and Pz, using puretones and tones with frequency changes. They demonstrated that rising tone-complexes elicited the P300 with larger amplitude and shorter latency, possibly due to the larger magnitude of frequency change than falling tone-complexes and perceptual bias for approaching sounds as a warning cue in natural environment.

On the other hand, a variety of speech stimuli have been used to evoke the P300 elicited. Lew, Slimp, Price, & Massagli (1999) used the word of ‘mommy’ as the deviant stimulus and 1,000 Hz tone as the standard stimulus to examine the P300 evoked by speech stimuli. Their results showed that the speech-evoked P300 was larger than tone-evoked P300 in patients with traumatic brain injury. However, the comparison of tone and speech-evoked P300 responses needs to be interpreted cautiously because the word stimulus of ‘mommy’ is the very easily discriminated from the 1,000 tone and can elicit the larger P300, regardless of tone versus speech-evoked responses. While non-meaningful syllable stimuli have been widely used in the study of speech evoked P300 (Groenen et al., 2001; Oppitz et al., 2015), some investigators have examined how the P300 is evoked by sub-phonemic speech stimuli (Dalebout & Stack, 1999; Dehaene-Lambertz, 1997; Horev et al., 2007; Maiste, Wiens, Hunt, Scherg, & Picton, 1995; Tampas, Harkrider, & Hedrick, 2005). Maiste et al. (1995) constructed two sets of stimuli where each set had 1,000 stimuli from a nine-step /ba/-/da/continuum. For one set of stimuli, /ba/ endpoint stimuli and the other eight stimuli were presented by 52% and 48% (6% each), respectively. For the other set of stimuli, /da/ stimuli and the other eight stimuli were presented by 52% and 48% (6% each), respectively. The results showed that the N200-P300 complex had greater amplitude for more improbable stimuli, suggesting it was sensitive to reflect the phonetic level. Tampas et al. (2005) measured the mismatch negativity (MMN) and P300 with discrimination data using both synthetic speech contrasts and non-speech analogous acoustic contrasts as stimuli. Their results showed that MMN was elicited by non-speech stimuli only and that the P300 was elicited by both non-speech and speech stimuli, and electrophysiological data were associated with discrimination accuracy.

Based on the results of previous studies, the present study used consonant-vowel (CV) stimuli straddling the phoneme category boundary as stimuli to examine whether the P300 is sensitive to subtle phonetic changes. The present study replicates and extends the preliminary study (Lee, 2012) by doubling participants, measuring over wider scalp areas, and combining behavioral and ERP measures. Findings from this earlier study showed that the P300 was elicited by across-category stimuli, P300 amplitude was larger for greater deviancy of stimuli, P300 latency was not significant for the deviancy of stimuli, and P300 was larger over central and parietal regions than frontal region. Limitations of this study included a small number of subjects (N=10) as well as limited scalp recording sites (Fz, Cz, and Pz) and no behavioral data. In addition, there were no significant effects of P300 latency that were observed, contrary to earlier studies. The current study sought to confirm the stimulus-based results of the preliminary study while boosting the chance to observe latency effects by doubling the subject (N= 20). To determine whether scalp distribution was accurately assessed in the previous study, the current study tripled the recording sites from three to nine (including now F3, Fz, F4, C3, Cz, C4, P3, Pz, and P4). It is possible that the expanded recording sties may show a different response distribution, perhaps showing evidence of frontal lobe activity affecting or modulating the P300 response to speech contrasts. Finally, this study attempted to show how behavioral measures of accuracy response and reaction times occur for the deviancy of stimuli in discrimination task and might be related to P300 amplitude and latency measures. Thus, our aim in the current study was to examine in depth the nature and characteristics of the P300 response reflected in the processing of across-category speech stimuli by determining whether (1) the P300 is different for the extent of deviancy of stimuli and a scalp distribution, (2) the behavioral response is different for the extent of deviancy of stimuli, and (3) the P300 is correlated with the behavioral response.

METHODS

Subjects

Twenty native English speakers (male 8, female 12) served as subjects in the present study. The participants were ranged in age from 18 to 36 years old (mean= 23.3 years). All subjects were right-handed (Oldfield, 1971) and had no history of neurological or psychological disorders nor speech, language, or hearing disorders. Their audiometric thresholds were at or below 15 dB HL for the octave frequencies between 250 and 8,000 Hz (American National Standards Institute, 2004).

Stimuli

A nine-step series /bi/-/pi/ continuum was used as stimuli by manipulating voice onset time (VOT) from a male native English speaker's naturally-produced /bi/ syllable. The syllable was recorded in a quiet room, using a high quality microphone (Spher-O-Dyne) held approximately 5 cm the speaker's mouth, was saved as a file sampled at 44.1 kHz (CSRE version 4.5). This syllable was used as the prototypical /bi/ (Stimulus 1) with a VOT of 0 ms. To provide a more easily defined initial burst, and to aid in measuring VOT, the noise burst of a synthetic /pi/ was made and placed at the front of the /bi/ syllable. The noise burst portion increased in 8 ms steps so that the prototypical /pi/ (i.e., Stimulus 9) had a VOT of 64 ms (Adobe Audition version 1.5). Table 1 shows VOT of stimuli on a nine-step series /bi/-/pi/ continuum.

Table 1.

Voice onset time (VOT) of stimuli on a nine-step series/bi/-/pi/ continuum

	Stimulus
	1	2	3	4	5	6	7	8	9
VOT (ms)	0	8	16	24	32	40	48	56	64

From the results of labeling test on a /bi/-/pi/ continuum, one within-category CV and two across-category CVs were selected as stimuli. Stimulus 2 was used as the standard stimulus because it showed the greatest labeling performance. Stimulus 4 was used as weak deviant stimulus (WD) and Stimulus 5 as strong deviant stimulus (SD) because Stimulus 4 is was acoustically closer to the standard stimulus while Stimulus 5 is was acoustically farther from the standard stimulus. The details about determining the stimuli is described in Procedure section.

A total of 600 stimuli composed of 480 standard stimuli (80%) and 120 deviant stimuli (20%) with 60 for WD and 60 for SD were presented in an active oddball paradigm into two blocks. The presentation order of blocks was counterbalanced. Each block included 300 stimuli composed of 240 standard stimuli, 30 WD, and 30 SD. The stimuli were presented in pseudorandom sequences where the first deviant followed at least eight standard stimuli and no less than three standard stimuli continued after one deviant. Each stimulus was presented with a 1,100 ms inter-stimulus interval (ISI) to allow for measurement of response time. Subjects were given a 3-minute break upon the completion of each block.

Procedure

Prior to the experiment, all subjects signed informed consent statements, filled out a case history form, and received a hearing screening test.

Labeling test

To determine the speech stimuli in the oddball paradigm, a forced choice labeling test was administered prior to the EEG and discrimination test. For the labeling test, subjects were instructed to click the letter ‘b’ or ‘p’ on the computer screen by using a mouse. A total of 90 sounds were presented at a comfortable level (approximately 75 dB SPL) through headphones in a sound-treated booth. Each sound was presented 10 times in random order. The resulting averaged psychometric function from the labeling test was used to determine the standard stimulus, WD, and SD.

To analyze the mean /bi/ response, the percentage score of /bi/ response was converted by arcsine transformation. A one-way repeated measures analysis of variances (ANOVA) was conducted on the arcsine transformed /bi/ response with the deviancy condition (3 levels: standard, WD, SD) as the factor. Bonferroni corrections were used for any subsequent univariate testing.

The labeling performance was different for deviancy condition with large effect size (F(_2.911, _55.310) = 423.811, p< .001, ηp² =.957) (Cohen, 1992). The /bi/ response for Stimulus 2 was significantly greater than Stimulus 4 and Stimulus 5, and Stimulus 4 was significantly greater than Stimulus 5. Thus, Stimulus 4 was used as WD, and Stimulus 5 as SD for the experiment. Figure 1 shows the /bi/ response for the /bi/-/pi/ continuum, and Figure 2 shows the /bi/ response for Stimulus 2 (standard stimulus), Stimulus 4 (WD), and Stimulus 5 (SD).

Figure 1.

/bi/ response for the /bi/-/pi/ continuum. Error bas indicate 1 standard error from the mean.

Figure 2.

/bi/ response for Stimulus 2 (standard), Stimulus 4 (WD), and Stimulus 5 (SD). Error bas indicate 1 standard error from the mean. WD=weak deviant stimulus; SD=strong deviant stimulus.

EEG & discrimination test

Subjects were seated comfortably in a reclining chair in a dark sound-treated booth, and given the response button. They were instructed to press the response button as soon as they heard a stimulus that sounded different from the frequently presented stimuli. EEG data were recorded using Compumedics NeuroScan Scan 4.3.3 software and SynAmps 2 system while speech stimuli were presented and behavioral responses were recorded using Compumedics NeuroScan STIM2. To ensure subjects can detect the deviant stimulus, they took part in the experiment only when their performance reached 100% on the preliminary test. The stimuli were presented binaurally through Etymotic ER-3A insert earphones at approximately 75 dB SPL, which is consistent with the loudness levels of normal conversation.

While subjects were responding behaviorally, EEG responses were acquired from F3, Fz, F4, C3, Cz, C4, P3, Pz, and P4 according to International 10–20 System (Jasper, 1958). Figure 3 shows International 10–20 System. The electrodes were connected to a 64-channel NeuroScan system, referenced by the cap electrode between Pz and Pcz and grounded by the cap electrode at Fpz. Also, to measure vertical and horizontal electrooculograms (EOGs), the electrodes were placed above, below, and on the inner and outer canthi of the left eye. All electrodes were Ag/AgCl. During data collection, impedance was maintained at 5 kΩ. The continuous EEG recordings were obtained for 100 ms before and 1,000 ms after the stimulus onset, using a bandpass of 0.01–30 Hz and digitally sampled at 500 Hz. Only data associated with correct behavioral response, EOG less than 70 μ V, greater than 40 averaged responses in each deviancy condition were included for further analysis. An offline re-reference was administered. Grand average waveforms were derived from averaging all individual waveforms.

Figure 3.

International 10–20 system.

Data Analysis

P300 response

All post-stimulus data points were adjusted automatically relative to the average amplitude of the pre-stimulus baseline. The P300 amplitude and latency were measured on the deviant waveforms. Two two-way repeated measures ANOVAs were conducted on the P300 amplitude and latency, respectively, with the deviancy condition (2 levels: WD, SD) and scalp distribution (3 levels: frontal region, central region, parietal region) as the factors. For the analysis of the scalp distribution on the P300 amplitude and latency, the data from three electrodes on each scalp region were averaged. For example, the data for the frontal region was averaged from F3, Fz, and F4. When Mauchly's test of sphericity was significant, Greenhouse-Geisser adjusted degrees of freedom, F- and p-values were reported. On the use of multiple ANOVAs, Bonferroni adjustments were used such that an alpha level of .025 (.05/2) was considered significant. Bonferroni corrections were applied to all posthoc tests performed.

Behavioral response

For behavioral performance, accuracy response and reaction times were measured. Accuracy response was analyzed on the arcsine transformed score of the percentage of correct responses. Reaction times were analyzed on the time (ms) between the triggering point of the auditory stimuli and the moment participants pressed the response button. Two one-way repeated-measures ANOVAs were conducted on accuracy response and reaction times with context condition (2 levels: WD, SD) as the factor, respectively. On the use of multiple ANOVAs, Bonferroni corrections were applied such that an alpha level of p < .025 (.05/2) was considered. Bonferroni corrections were applied to all post-hoc tests performed.

Correlation between P300 and behavioral responses

The correlation between P300 and behavioral data was analyzed, using Pearson product-moment correlations between P300 amplitude, P300 latency, accuracy response, and reaction times. On the use of multiple bivariate correlations, Bonferroni corrections were used such that an alpha level of p < .017 (.0125/4) was considered.

RESULTS

P300 Response: Amplitude and Latency

A robust P300 response was elicited by both deviant stimuli (WD, SD) on the overall scalp distribution. The P300 amplitude was significantly different for deviancy condition (F(_1.,19) = 21.810, p < .001, ηp² =.534) and for scalp distribution (F(_{1.553,29.515)} = 77.345, p < .001, ηp² =.803), respectively (Cohen, 1992). The P300 amplitude was larger for SD than for WD and was larger over central and parietal regions than the frontal region. There was no significant interaction of deviancy condition and scalp distribution. Table 2 shows P300 amplitude (µV) by scalp distribution. The P300 grand average waveforms are presented in Figure 4.

Table 2.

P300 amplitude (μV) by scalp distribution

Scalp distribution	WD	SD
Frontal
F3	0.36 (3.52)	2.41 (4.13)
Fz	0.16 (3.64)	2.19 (4.37)
F4	1.09 (3.93)	2.98 (4.63)
Central
C3	4.64 (4.07)	6.89 (4.39)
Cz	6.43 (5.28)	8.36 (5.38)
C4	6.21 (4.16)	8.18 (4.26)
Parietal
P3	8.08 (4.38)	10.12 (4.23)
Pz	9.02 (5.33)	11.07 (4.91)
P4	7.62 (4.72)	9.11 (4.22)

Values are presented as mean (SD).

WD=weak deviant stimulus; SD = strong deviant stimulus.

Figure 4.

The P300 grand average waveforms. The P300 for the standard stimulus condition is dotted, the WD condition is thin, and the SD condition is bold line. WD=weak deviant stimulus; SD=strong deviant stimulus.

The P300 latency was significantly different for deviancy condition (F(_1.,19) = 5.924, p =.025, ηp² =.238) while it was not significantly different for the scalp distribution. The P300 latency was shorter for SD than for WD. There was no interaction of deviancy condition and scalp distribution. Table 3 shows P300 latency (ms) by scalp distribution.

Table 3.

P300 latency (ms) by scalp distribution

Scalp distribution	WD	SD
Frontal
F3	430.30 (78.31)	390.70 (65.02)
Fz	396.10 (68.48)	383.60 (55.80)
F4	397.10 (71.99)	387.40 (60.73)
Central
C3	408.50 (79.42)	379.50 (49.84)
Cz	412.30 (83.07)	380.60 (62.72)
C4	422.40 (90.07)	404.30 (70.35)
Parietal
P3	390.80 (78.32)	377.90 (56.51)
Pz	400.50 (86.34)	367.50 (55.64)
P4	395.90 (77.95)	375.20 (57.34)

Values are presented as mean (SD).

WD =weak deviant stimulus; SD = strong deviant stimulus.

Behavioral Response: Discrimination Accuracy Response and Reaction Times

For discrimination data, both accuracy response and reaction times were significantly different for the deviancy condition. Accuracy response was significantly greater for SD than WD (F(_1.,19) = 24.435, p < .001, ηp² =.563). Reaction time was significantly faster for SD than WD (F(_1.,19) = 21.052, p < .001, ηp² =.526). The results of accuracy response and reaction times are shown in Figures 5 and 6. Table 4 shows the mean and standard deviation of accuracy response and reaction times by individual.

Figure 5.

Discrimination accuracy response. Error bas indicate 1 standard error from the mean.

Figure 6.

Discrimination reaction times. Error bas indicate 1 standard error from the mean.

Table 4.

Discrimination accuracy response and reaction times by subject

Subject no.	Accuracy (%)		Reaction time (ms)
Subject no.	WD	SD	WD	SD
1	92	100	411.87	377.42
2	100	100	261.25	240.97
3	93	98	459.61	425.17
4	95	97	318.63	299.16
5	67	90	599.2	500.8
6	97	100	367.36	369
7	98	100	366.85	330.67
8	90	97	455.56	426.16
9	78	90	352.94	357
10	93	90	359.64	333.41
11	90	97	234.69	230.9
12	77	88	275.15	247.57
13	82	87	393.18	358.94
14	65	83	431.92	380.28
15	82	97	485.9	458.02
16	98	98	380.05	338.1
17	97	92	348.21	322.89
18	98	100	380.49	377.4
19	93	97	283.39	289.64
20	72	93	386.44	390.88
Mean (SD)	88 (11)	93 (5)	377.62 (84.89)	352.72 (70.57)

WD=weak deviant stimulus; SD= strong deviant stimulus.

Correlation between P300 and Behavioral Responses

P300 amplitude had significantly negative correlation with reaction times (r = −.471, p =.002) and P300 latency had significantly positive correlation with reaction times (r =.502, p =.001) while the other correlations were not significant. Table 5 shows the results of the bivariate correlation with Pearson correlation coefficient.

Table 5.

Bivariate correlation between P300, accuracy response, and reaction times

	P300 amplitude	P300 latency	Accuracy response	Reaction time
P300 amplitude	1.0	−.332	.221	−.471*
P300 latency		1.0	−.253	.502 *
Accuracy response			1.0	−.340
Reaction time				1.0

^* p<.017.

DISCUSSION & CONCLUSION

The P300 amplitude was sensitive to the degree of deviancy of across-category stimuli. The P300 was significantly larger for SD than for WD, suggesting that the P300 can reflect subtle perceptual processing at the sub-phonemic level. It may be because SD is perceptually more deviant from the standard stimulus than WD, and P300 is more sensitive to the more improbable stimuli. The P300 is very sensitive to stimulus probability and task difficulty (Folstein & Van Petten, 2008). The less demanding the task is, the greater the P300 amplitude (Polich, 2007). P30 amplitude is known to reflex neural changes in brain activity when new sensory input are presented, which is sensitive to the amount of central nervous system activity engaged with incoming information processing during a specific task (Johnson, 1993; Polich, 2004). Also, the P300 amplitude was significantly larger over the central and parietal regions than the frontal region, even with expanded recording sites. These findings were in line with previous findings in that it occurred maximally over the centro-parietal areas (Folstein & Van Petten, 2008; Martin, Tremblay, & Korczak, 2008; Picton, 1992).

The P300 latency was significantly shorter for SD than for WD, though it was not significant with smaller subjects (N=10) in the preliminary study. It is in line with some studies that reported longer P300 latency for more difficult tasks (Fitzgerald & Picton, 1983; Johnson & Donchin, 1980). This findings show that the shorter P300 latency for SD is associated with faster speed of information processing on easier tasks. P300 latency is different from the behavioral reaction times because the latency is a motor-free measure of the cortical nervous system (Duncan-Johnson & Kopelll, 1981). The P300 latency is known to index the speed for stimulus evaluation processes and the prolonged latency has been attributed to a lack of neural synchrony with increase in task difficulty (Henkin, Kishon-Rabin, Gadoth, & Pratt, 2002; Kalaiah & Shastri, 2016).

The P300 is engaged with conscious cognitive process and correlated with behavioral performance (Dalebout & Stack, 1999; White et al., 2010). In analogy with the results of P300 amplitude and latency, accuracy response was greater and reaction times were faster for SD than for WD, probably because discrimination of SD from the standard stimulus is associated with less processing demand compared to discrimination of WD from the standard stimulus. Using tone stimuli, White et al. (2010) showed all participants yielded P300 responses to perceptible contrasts with no response to the imperceptible contrasts. Also, there was significant correlation between P300 and reaction times. The results suggests that behavioral responses are related with the P300 amplitude and P300 latency, respectively, and reaction times is more sensitive to the association with the P300 rather than accuracy response. These correlational results suggest that the P300 more strongly reflects perceived certainty/uncertainty, with phonetic accuracy awaiting further refinement.

In summary, the P300 may reflect the subtle perceptual processing of phonetic changes, and the scalp distribution of the P300 is more dominant in the central and parietal regions versus the frontal region. Behavioral accuracy response was greater and reaction times were faster in less demanding tasks. Reaction times of discrimination were negatively correlated with the P300 amplitude, and positively correlated with the P300 latency.

REFERENCES

Alvarenga, KDF., Araújo, ES., Ferraz, É., & Crenitte, PAP. (2013). P300 auditory cognitive evoked potential as an indicator of therapeutical evolution in students with developmental dyslexia. CoDAS. 25(6):500–505.

American National Standards Institute. (2004). Specifications for audiometers (ANSI S3.6-2004). New York, NY: Author.

Bennington, JY., & Polich, J. (1999). Comparison of P300 from passive and active tasks for auditory and visual stimuli. International Journal of Psychophysiology. 34(2):171–177.

Blackwood, DHR., & Muir, WJ. (1990). Cognitive brain potentials and their application. The British Journal of Psychiatry. 157(S9):96–101.

Bornstein, MH., & Korda, NO. (1984). Discrimination and matching within and between hues measured by reaction times: some implications for categorical perception and levels of information processing. Psychological Research. 46(3):207–222.

Cohen, J. (1992). A power primer. Psychological Bulletin. 112(1):155–159.

Dalebout, SD., & Stack, JW. (1999). Mismatch negativity to acoustic differences not differentiated behaviorally. Journal of the American Academy of Audiology. 10(7):388–399.

Damper, RI., & Harnad, SR. (2000). Neural network models of categorical perception. Perception & Psychophysics. 62(4):843–867.

Dehaene-Lambertz, G. (1997). Electrophysiological correlates of categorical phoneme perception in adults. NeuroReport. 8(4):919–924.

Duncan, CC., Barry, RJ., Connolly, JF., Fischer, C., Michie, PT., Näätänen, R., & Van Petten, C. (2009). Event-related potentials in clinical research: guidelines for eliciting, recording, and quantifying mismatch negativity, P300, and N400. Clinical Neurophysiology. 120(11):1883–1908.

Duncan-Johnson, CC., & Kopell, BS. (1981). The Stroop effect: brain potentials localize the source of interference. Science. 214(4523):938–940.

Etcoff, NL., & Magee, JJ. (1992). Categorical perception of facial expressions. Cognition. 44(3):227–240.

Fitzgerald, PG., & Picton, TW. (1983). Event-related potentials recorded during the discrimination of improbable stimuli. Biological Psychology. 17(4):241–276.

Folstein, JR., & Van Petten, C. (2008). Influence of cognitive control and mismatch on the N2 component of the ERP: a review. Psychophysiology. 45(1):152–170.

Groenen, PA., Beynon, AJ., Snik, AF., & van den Broek, P. (2001). Speech-evoked cortical potentials recognition in cochlear implant users and speech. Scandinavian Audiology. 30(1):31–40.

Hall, JW. (2007). P300 response. New handbook for auditory evoked responses. (pp. 518–547). Boston, MA: Pearson.

Harnad, S. (1987). Psychophysical and cognitive aspects of categorical perception: a critical overview. Categorical perception: the groundwork of cognition. (pp. 1–25). New York, NY: Cambridge University Press.

Harnad, S. (2003). Categorical perception. Encyclopedia of cognitive science. (pp. 448–452). London: Nature Publishing Group/Macmillan.

Henkin, Y., Kishon-Rabin, L., Gadoth, N., & Pratt, H. (2002). Auditory event-related potentials during phonetic and semantic processing in children. Audiology and Neurotology. 7(4):228–239.

Horev, N., Most, T., & Pratt, H. (2007). Categorical perception of speech (VOT) and analogous non-speech (FOT) signals: behavioral and electrophysio-logical correlates. Ear and Hearing. 28(1):111–128.

Hutchinson, KM., & McGill, DJ. (1997). The efficacy of utilizing the P300 as a measure of auditory deprivation in monaurally aided profoundly hearing-impaired children. Scandinavian Audiology. 26(3):177–185.

Jasper, HH. (1958). The ten-twenty electrode system of the International Federation. Electroencephalography and Clinical Neurophysiology. 10, 370–375.

Jeon, YW., & Polich, J. (2003). Meta‐analysis of P300 and schizophrenia: patients, paradigms, and practical implications. Psychophysiology. 40(5):684–701.

Johnson Jr, R. (1993). On the neural generators of the P300 component of the event‐related potential. Psychophysiology. 30(1):90–97.

Johnson Jr, R., & Donchin, E. (1980). P300 and stimulus categorization: two plus one is not so different from one plus one. Psychophysiology. 17(2):167–178.

Kalaiah, MK., & Shastri, U. (2016). Cortical auditory event related potentials (P300) for frequency changing dynamic tones. Journal of Audiology & Otology. 20(1):22–30.

Lee, JY. (2012). Auditory P300 responses evoked by perceptually different between-category CV stimuli. Korean Journal of Communication Disorders. 17(3):499–507.

Lew, HL., Slimp, J., Price, R., & Massagli, TL. (1999). Comparison of speech-evoked v tone-evoked P300 response: implications for predicting outcomes in patients with traumatic brain injury. American Journal of Physical Medicine & Rehabilitation. 78(4):367–371.

Liberman, AM., Harris, KS., Hoffman, HS., & Griffith, BC. (1957). The discrimination of speech sounds within and across phoneme boundaries. Journal of Experimental Psychology. 54(5):358–368.

Maiste, AC., Wiens, AS., Hunt, MJ., Scherg, M., & Picton, TW. (1995). Event-related potentials and the categorical perception of speech sounds. Ear and Hearing. 16(1):68–90.

Martin, BA., Tremblay, KL., & Korczak, P. (2008). Speech evoked potentials: from the laboratory to the clinic. Ear and Hearing. 29(3):285–313.

McPherson, DL., Ballachanda, BB., & Kaf, W. (2000). Middle and long latency auditory evoked potentials. In RJ. Roeser (Ed.), Audiology diagnosis. (pp. 471–501). New York, NY: Thieme.

Mueller, JL. (2005). Electrophysio-logical correlates of second language processing. Second Language Research. 21(2):152–174.

Oldfield, RC. (1971). The assessment and analysis of handedness: the Edinburgh inventory. Neuropsychologia. 9(1):97–113.

Oppitz, SJ., Didoné, DD., Silva, DD., Gois, M., Folgearini, J., Ferreira, GC., & Garcia, MV. (2015). Long-latency auditory evoked potentials with verbal and nonverbal stimuli. Brazilian Journal of Otorhinolaryngology. 81(6):647–652.

Patel, SH., & Azzam, PN. (2005). Characterization of N200 and P300: selected studies of the event-related potential. International Journal of Medical Sciences. 2(4):147–154.

Picton, TW. (1992). The P300 wave of the human event-related potential. Journal of Clinical Neurophysiology. 9(4):456–479.

Polich, J. (2004). Clinical application of the P300 event-related brain potential. Physical Medicine and Rehabilitation Clinics in North America. 15(1):133–161.

Polich, J. (2007). Updating P300: an integrative theory of P3a and P3b. Clinical Neurophysiology. 118(10):2128–2148.

Purdy, SC., Kelly, AS., & Davies, MG. (2002). Auditory brainstem response, middle latency response, and late cortical evoked potentials in children with learning disabilities. Journal of the American Academy of Audiology. 13(7):367–382.

Salamat, MT., & McPherson, DL. (1999). Interactions among variables in the P300 response to a continuous performance task. Journal of the American Academy of Audiology. 10(7):379–387.

Sutton, S., Braren, M., Zubin, J., & John, ER. (1965). Evoked-potential correlates of stimulus uncertainty. Science. 150(3700):1187–1188.

Tampas, JW., Harkrider, AW., & Hedrick, MS. (2005). Neurophysiological indices of speech and non-speech stimulus processing. Journal of Speech, Language, and Hearing Research. 48(5):1147–1164.

White, L., Stuart, A., & Najem, F. (2010). Mismatch negativity and P300 to behaviorally perceptible and imperceptible temporal contrasts. Perceptual and Motor Skills. 110(3_Suppl):1105–1118.

Wible, B., Nicol, T., & Kraus, N. (2005). Correlation between brainstem and cortical auditory processes in normal and language-impaired children. Brain. 128(2):417–423.

Young, AW., Rowland, D., Calder, AJ., Etcoff, NL., Seth, A., & Perrett, DI. (1997). Facial expression megamix: tests of dimensional and category accounts of emotion recognition. Cognition. 63(3):271–313.