Lecture Notes in Computer Science

Applying the String Method to Extract Bursting

bet	6/88
Sana	16.12.2017
Hajmi	12.42 Mb.
	#22381

1 2 3 4 5 6 7 8 9 ... 88

Applying the String Method to Extract Bursting

Information from Microelectrode Recordings in

Subthalamic Nucleus and Substantia Nigra

Pei-Kuang Chao

, Hsiao-Lung Chan

1,4

, Tony Wu

2,4

, Ming-An Lin

and Shih-Tseng Lee

3,4

Department of Electrical Engineering, Chang Gung University

Department of Neurology, Chang Gung Memorial Hospital

Department of Neurosurgery, Chang Gung Memorial Hospital

Center of Medical Augmented Virtual Reality, Chang Gung Memorial Hosipital

259 Wen-Hua First Road, Gui-Shan, 333, Taoyuan, Taiwan

peikuang@ms16.hinet.net

Abstract. This paper proposes that bursting characteristics can be effective pa-

rameters in classifying and identifying neural activities from subthalamic nu-

cleus (STN) and substantia nigra (SNr). The string method was performed to

quantify bursting patterns in microelectrode recordings into indexes. Inter-

spike-interval (ISI) was used as one of the independent variables to examine ef-

fectiveness and consistency of the method. The results show consistent findings

about bursting patterns in STN and SNr data across all ISI constraints. Neurons

in STN tend to release a larger number of bursts with fewer spikes in the bursts.

Neurons in SNr produce a smaller number of bursts with more spikes in the

bursts. According to our statistical evaluation, 50 and 80 ms are suggested as

the optimal ISI constraint to classify STN and SNr’s bursting patterns by the

string method.

Keywords: Subthalamic nucleus, substantia nigra, inter-spike-interval, burst,

microelectrode.

1 Introduction

Subthalamic nucleus (STN) is frequently the target to study and to treat Parkinson’s

disease [1, 2]. Placing a microelectrode to record neural activities in deep brain nuclei

provides useful information for localization during deep brain stimulation (DBS)

neurosurgery. DBS has been approved by FDA since 1998[3]. The surgery implants a

stimulator to deep brain nuclei, usually STN, to alleviate Parkinson’s symptoms, such

as tremor and rigidity. To search for STN in operation, a microelectrode probe is

often used to acquire neural signals from outer areas to the specific target. With assis-

tance of imagery techniques, microelectrode signals from different depth are read and

recorded. Then, an important step to determine STN location is to distinguish signals

of STN from its nearby areas, e.g. subtantia nigra (SNr) (which is a little ventral and

medial to STN). Therefore, characterizing and quantifying firing patterns of STN and

Applying the String Method to Extract Bursting Information

SNr are essential. Firing rate defined as the number of neural spikes within a period is

the most common variable used for describing neural activities. However, STN and

SNr have a broad range of firing rate and mostly overlapped [4] (although SNr has a

slightly higher mean firing rate than STN). This makes it difficult to depend on firing

rate to target STN.

Bursting patterns may provide a better solution to separate signals from different

nuclei. Bursting, defined as clusters of high-frequency spikes released by a neuron, is

believed storing important neural information. To establish long-term responses, cen-

tral synapses usually require groups of action potentials (bursts) [5,6]. Exploring

bursting information in neural activities has recently become fundamental in Parkin-

son’s studies [1,2]. Also, that spike arrays are more regular in SNr than in STN sig-

nals is observed [7]. However, the regularity of firing or grouping of spikes in STN

and SNr potentials has not been investigated thoroughly. This study aims to extract

bursting information from STN and SNr. A quantifying method for bursting, the

string method, will be applied. The string method quantifies bursting information

relying on inter-spike-interval (ISI) and spike number [8]. Although some other

methods for quantifying bursts exist [9,10], the string method is the one which can

provide information about what spike members contributing to the detected bursts. In

addition, because various ISIs have been used in research [8,9] to define bursts, this

study will also evaluate the effect of ISI constraints on discriminating STN and SNr

signals.

2 Method

The neuronal data used in this study were acquired during DBS neurosurgery in

Chang Gung Memorial Hospital. With assistance of imagery localization systems

[11], trials (10s for each) of microelectrode recordings were collected at a sampling

a

b

Fig. 1. MRI images from one patient: a. In the sagittal plane – the elevation angle of the probe

(yellow line) from the inter-commissural line (green line) was around 50 to 75

; b. In the fron-

tal plane – the angle of the probe (yellow lines) from the midline (green line) was about 8 to

to right or left.

P.-K. Chao et al.

rate of 24,000 Hz. Based on several observations, e.g. magnetic resonance imaging

(MRI), computed topography (CT), motion/perception-related responses and probe

location according to a stereotactic system, experienced neurologists diagnosed 18

trials as neural signals from STN, and the other 23 trials as from SNr. The trials which

were collected outside STN and SNr and/or confused between STN and SNr were

excluded. In this paper, the data are from 3 Parkinson’s patients (2 females, 1 male,

age=73.3

8.3 y/o) who received DBS treatment. Due to the patients’ individual dif-

ference, e.g. head size, the depth of STN from the scalp was found varied between 15

and 20 cm. During surgery, the elevation angle of the probe from the inter-

commissural line was around 50 to 75

(Fig 1a) and the angle between the probe and

the midline was about 8 to 18

toward either right or left (Fig 1b).

2.1 Spike Detection

Each trial of microelectrode recordings includes 2 types of signals, spikes and back-

ground signals. The background signals are interference from nearby neural areas or

environment. Because background signals can be interpreted as a Gaussian distribu-

tion, signals which are 3 standard deviations (SD) above or below mean can be treated

as non-background signals or spikes. Therefore, a threshold in the level of mean plus

3 SD is applied in this study to detect spikes (Fig 2).

5.175

5.18

5.185

5.19

5.195

5.2

5.205

5.21

100

time (s)

amplitude

Fig. 2. A segment of microelectrode recording – the red horizontal line is threshold; the green

stars indicate the found spikes; most signals around the baseline are background signals

Applying the String Method to Extract Bursting Information

51

2.2 The String Method

Every detected spike was plotted as a circle in a spike sequential number versus spike

occurring time figure (Fig 3). The spike sequential number is starting at 1 for the first

spike in a trial. The spikes which are closer to each other are labeled as strings [8] and

defined as bursts. Two parameters were controlled and manipulated to determine

bursts: (1) the minimum number of spikes to form a burst was 5; (2) the maximum ISI

of the adjacent spikes in a burst was set as 20ms, 50ms, 80ms, and 110ms separately

to find an optimal condition to distinguish STN and SNr bursting patterns.

5.5

6.5

100

120

140

160

180

time (s)

spike sequential number

Fig. 3. A segment of strings plot – each blue circle means a spike; the red triangles mean the

starting spikes of a burst; the black triangles mean the ending spikes of a burst

2.3 Dependent Variables

Three dependent variables were computed: (1) Firing rate (FR) was calculated as total

spike number divided by trial duration (10 s). (2) Number of bursts (NB) was deter-

mined by the string method as the total burst number in a trial. (3) The average of

spike number in each burst (SB) was also counted in every trial.

2.4 Statistical Analysis

Independent sample’s t-test was applied to test firing rate difference between STN

and SNr signals. MANOVA was performed to evaluate NB and SB separately among

different ISI constraints (

=.05).

P.-K. Chao et al.

3 Results

The signals from STN and SNr showed similar firing rate but different bursting pat-

terns. There is no significant difference between STN and SNr in firing rate (STN:

57.0

22.1; SNr: 68.8

23.5) (p>.05). The results of NB and SB are listed in Table 1

and Table 2. In NB, SNr has significantly fewer bursts than STN while ISI setting is

50ms and 80 ms (p<.05). In SB, SNr has significant more spikes in bursts than STN

while ISI setting is 20ms, 50ms and 80 ms (p<.05). Based on the findings, 2 points

can be addressed: First, STN and SNr have different bursting patterns, although their

total numbers of spikes (firing rate) are similar. Comparing to STN, SNr releases

fewer bursts but each burst contains more spikes. Second, setting ISI constraints

around 50~80 ms in the string method can be effective to distinguish the difference

between STN and SNr signals.

Table 1. NB results in SNr and STN

ISI constraint

SNr

STN

20ms

32.1

15.7 38.2

23.1

>.05

50ms

8.9

6.8 23.9

7.7

<.05*

80ms

5.2

5.8

10.7

7.5

<.05*

110ms

3.1

3.4

4.9

4.4

>.05

Table 2. SB results in SNr and STN

ISI constraint

SNr

STN

20ms

21.6

27.8 8.5

2.2

<.05*

50ms

202.5

309.9 28.1

24.1

<.05*

80ms

396.0

373.9

124.2

141.3

<.05*

110ms

515.1

367.6

337.7

342.8

>.05

4 Discussion and Conclusion

Microelectrode recordings from STN and SNr show valuable information in bursting

patterns which may be useful to assist neurosurgery in the future. From the results,

STN and SNr show very different patterns in bursting. Neurons in STN tend to release

more “small” bursts which contain fewer spikes. Neurons in SNr tend to produce

“giant” bursts which contain a big number of spikes. These bursting characteristics

are quantified into NB and SB which may assist in making decision about localizing

stimulation probes during DBS operations. Also, the simplicity of the string method

can offer quick information and be efficient in real-time analysis.

Applying the String Method to Extract Bursting Information

Different bursting characterisitcs in STN and SNr are revealed across all ISI con-

straints (20, 50, 80, 110 ms) in both variables, although statistical significance only

shows in 50 and 80ms. Because there is no “gold standard” to determine ISIs of adja-

cent spikes in bursts, several ISI constraint settings were tested in this study. No

matter in which setting, STN has a larger NB and smaller SB than what SNr has.

Statistical significance only shows in both dependent variables in 50 and 80 ms set-

tings. Therefore, we suggest the optimal ISI setting for identifying bursts in STN and

SNr signals should be around 50 and 80 ms.

For further studies, signals from other deep brain nuclei may be analyzed to enrich

the application of bursting information. Also, since the bioelectrical signals from deep

brain nuclei are non-stationary, non-linear methods, e.g. complexity, would be per-

formed and compared with current results to provide more nucleus-identifying clues

in the future.

Acknowledgments. The authors would like to express sincere appreciation to

the grant support from the Ministry of Economic Affairs in Taiwan, under contract

95-EC-17-A-19-S1-035.

References

Baufreton, J., Zhu, Z.-T., Garret, M., Bioulac, B., Johnson, S.W., Taupignon, A.I.: Dopamine

Receptors Set the Pattern of Activity Generated in Subthalamic Neurons. FASEB J. 19,

1771–1777 (2005)

Magarinos-Ascone, C.M., Figueiras-Mendez, R., Riva-Meana, C., Cordoba-Fernadez, A.:

Subthalamic Neuron Activity Related to Tremor and Movement in Parkinson’s Disease. Eur.

J. Neurosci. 12, 2597–2607 (2000)

Deushl, G., Volkmann, J., Krack, P.: Deep Brain Stimulation for Movement Disorders. Mov.

Disord. 17, S1-S1 (2002)

Sterio, D., Zonenshayn, M., Mogilner, A.Y., Rezai, A.R., Kiprovski, K., Kelly, P.J., Beric,

A.: Neurophysiological Refinement of Subthalamic Nucleus Targeting. Neurosurg 50, 58–69

(2002)

Zucker, R.S.: Frequency Dependent Changes in Excitatory Synaptic Efficacy. In: Dichter,

M.A. (ed.) Mechanisms of Epiletogenesis, pp. 153–157. Plenum Press, New York (1988)

Lisman, L.E.: Bursts as a Unit of Neural Information: Making Unreliable Synapses Reliable.

TINS 20, 38–43 (1997)

Benazzouz, A., Breit, S., Koudsie, A., Pollak, P., Krack, P., Benabid, A.L.: Intraoperative

Microrecordings of the Subthalamic Nucleus in Parkinson’s Disease. Mov. Disord. 17, S145-

S149 (2002)

Turnbull, L., Dian, E., Gross, G.: The String Method of Burst Identification in Neuronal

Spike Trains. J. Neurosci. Methods 145, 23–35 (2005)

Mulloney, B.: A Method to Measure the Strength of Multi-unit Bursts of Action Potentials. J.

Neurosci. Methods 146, 98–105 (2005)

10.

Favre, J., Taha, J.M., Baumann, T., Burchiel, K.J.: Computer Analysis of the Tonic, Phasic,

and Kinesthetic Activity of Pallidal Discharges in Parkinson Patients. Surg. Neurol. 51, 665–

673 (1999)

11.

Lee, J.-D., Huang, C.-H., Lee, S.-T.: Improving Stereotactic Surgery Using 3-D Reconstruc-

tion. IEEE Eng. Med. Biol. Mag. 21, 109–116 (2002)

Population Coding of Song Element Sequence

in the Songbird Brain Nucleus HVC

Jun Nishikawa

, Masato Okada

, and Kazuo Okanoya

RIKEN Brain Science Institute, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan

Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha,

Kashiwa, Chiba 277-8561, Japan

Abstract. Birdsong is a complex vocalization composed of various song

elements organized according to sequential rules. To reveal the neural

representation of song element sequence, we recorded the neural re-

sponses to all possible element pairs of stimuli in the Bengalese ﬁnch

brain nucleus HVC. Our results show that each neuron has broad but

diﬀerential response properties to element sequences. We calculated the

time course of population activity vectors and mutual information be-

tween auditory stimuli and neural activities. The clusters of population

vectors responding to second elements had a large overlap, whereas the

clusters responding to ﬁrst elements were clearly divided. At the same

timing, confounded information also signiﬁcantly increased. These results

indicate that the song element sequence is encoded in a neural ensemble

in HVC via population coding.

Introduction

Songbirds have a complex learned vocalization composed of various song ele-

ments with a typical sequential rule. In Bengalese ﬁnches, these rules follow

individually distinctive ﬁnite state syntax [1]. Songbirds have been intensively

studied as a model for the syntactical properties of human language [2]. It is im-

portant to reveal the neural representation of complex song element sequences in

the songbird brain. Based on the ﬁnding of sequential selective neurons [3,4], it

has been thought that the song element sequence is encoded in a chain of the rigid

selective neurons [5]. Alternatively, it can be encoded in a neural ensemble of

relatively broadly selective neurons in a distributed manner [6,7]. We attempted

to determine which neural representation actually occurs in the songbird brain.

Songbirds have a specialized brain area for generating and learning complex

vocalizations, and the area is called as a song system. From the importance of

auditory feedback in song learning, numerous studies have investigated auditory

neural representation in the song system [3,4]. Especially, HVC is one of the ma-

jor sensory-motor integration sites in the song system; these neurons selectively

respond to the bird’s own song (BOS) in a time-locked manner. Margoliash and

Fortune found the neurons selectively responded to only typical element pair

stimuli included in their own song [3,4]. This type of neuron was named the

temporal combination selective neuron (TCS neuron). Since the discovery of

M. Ishikawa et al. (Eds.): ICONIP 2007, Part I, LNCS 4984, pp. 54–63, 2008.

c Springer-Verlag Berlin Heidelberg 2008

Population Coding of Song Element Sequence

TCS neurons, it has been thought that the song element sequence is encoded in

a series of diﬀerent types of TCS neurons. However, TCS neurons were found

only in a small portion of the recorded data. Many song-selective neurons lack

TCS properties. In addition, the stimuli used in these experiments were only

partial presentations of the entire sequence, such as EE, EF, FE, or FF within

ABCDEFGHI. This design did not test for responses to other element pairs,

such as AB, GC, or any other combination. To more fully understand the neural

representation of song element sequences, we must evaluate activity in response

to all possible song element pairs within ABCDEFGHI.

In this study, we recorded single-unit activities of HVC neurons driven by

all possible song element pair stimuli in anesthetized Bengalese ﬁnches. Then,

we used sequential response distribution analysis, population dynamics analysis,

and information-theoretic analysis to show that the song element sequence is

encoded within a neural ensemble in HVC neurons by population coding. These

ﬁndings led us postulate an alternative scheme for encoding the song element

sequence in the songbird HVC, with distributed neural representation rather

than the chain model of rigid selective TCS neurons.

Material and Methods

2.1

Animals

Twenty-three adult Bengalese ﬁnches (> 180 days post-hatch) were used in

this study. All experimental procedures were performed according to established

animal care protocols approved by the animal care and use committee at RIKEN.

2.2

Stimuli

Undirected songs were recorded in a quiet soundproof box using a microphone

and ampliﬁer connected to a computer with a sampling rate of 44.1 kHz and

16-bit resolution. We calculated sonograms from the recorded song using sound

analysis software (SASLab Pro; Avisoft, Berlin, Germany). A birdsong consists

of a series of discrete song elements with silent intervals among them. Song

elements were divided into distinct types by visual inspection of the spectro-

temporal structure of each sonogram. The transition matrix, representing the

transition probability between each song element, was then calculated. We can

evaluate the syntactical structure of song in individual Bengalese ﬁnches using

this transition matrix. We prepared ﬁve diﬀerent types of sound stimuli: BOS,

REV, OREV, element, and element pair. BOS is the forward playback of the

bird’s own song, while REV is the reversed playback of the same song. OREV is a

modiﬁed version of the song, in which the spectro-temporal composition of each

song element is retained, but the order of the song elements has been reversed.

Element stimuli are isolated playbacks of each song element. For example, if the

song has nine elements, element stimuli are A, B, C, and so on. Element pair

stimuli are combinations of all possible element pairs. In this case, we prepared

81 stimuli, including AA, AB,

· · ·, IH, and II.

J. Nishikawa, M. Okada, and K. Okanoya

2.3

Recording Procedure

Before electrophysiological recording sessions, birds were anesthetized with 4

to 7 doses of 10% urethane (40 μl per dose) at 20-min intervals. The birds

were restrained in a custom-made chamber on a stereotaxic device (David Kopf

Instruments, Tujunga, CA, USA). The birds were ﬁxed with ear-bars and a

beak-holder that positioned the beak tip at an angle of 45 degrees below the

horizontal plane. The head was treated with Xylocaine gel and the feathers

and skin were removed. A custom-made three-point ﬁxation device (Narishige,

Tokyo, Japan) was attached to the rostral part of the skull surface with den-

tal cement. Small holes were made in the skull just above the HVC. Finally, the

dura was removed, and tungsten electrodes were set on the surface. The ear-bars

were removed before making physiological recordings. The birds were located in

an electromagnetically shielded sound-attenuation box while in the stereotaxic

device. The electrodes were lowered into the brain using a hydraulic micro-

positioner (MODEL640, David Kopf Instruments), and extracellular signals from

HVC were recorded. The signals from the electrodes were ampliﬁed (gain 10,000)

and ﬁltered (100 Hz-10 kHz bandpass) using an extracellular recording ampliﬁer

(ER-91, Cygnus Technology, Water Gap, PA, USA). The data were digitized

at 20 kHz with 16-bit resolution using the data acquisition system (Micro1401,

Cambridge Electronic Design, Cambridge, UK) and the associated software

(Spike2, Cambridge Electronic Design). The data were stored in a computer

disk for oﬀ-line analysis. During the neural recording session, sound stimuli were

presented at a peak sound pressure of 70 dB. At ﬁrst, we presented BOS, REV,

OREV, and the silent stimuli. Next, each of the elements and silent stimuli were

delivered. Finally, we presented element pairs and silent stimuli. Each sound

stimulus was presented 20 times in a random order with an interstimulus interval

of 3 to 5 s. The computer for neural recording and that for stimulus presentation

were synchronized by a trigger-signal generated simultaneously with the stimuli.

2.4

Data Analysis

Analyses were performed using custom-made programs written by MATLAB

(Mathworks, Natick, MA, USA). The mean spontaneous ﬁring rate was cal-

culated from the baseline activity registered during the silent stimulus. The re-

sponse strength RS was calculated by subtracting the spontaneous rate from the

ﬁring rate R registered during stimulus presentation. R and RS were measured

in each 10 ms bin, from which we calculated the average R, RS, and variance σ

across the stimulus presentation period. To determine the selectivity of each

neuron, we calculated the psychophysical measure , as previously described [8].

d (x

) =

2(x

− x

)

+ σ

(1)

In this equation, x

is the response to stimulus A, and x

is the response to

stimulus B. d (x

) represents the response selectivity to stimulus A relative

to stimulus B based on the mean and variation of the responses. We considered

a neuron to be selective for a stimulus when the selectivity satisﬁes d > 1.0 [9].

Population Coding of Song Element Sequence

2.5

Population Dynamics Analysis

To analyze neural activity at the population level, we performed a population

dynamics analysis [10]. With our experimental design, we were not able to com-

bine the data from diﬀerent individuals because their songs and elements thereof

were completely diﬀerent from each other. Therefore, we presented the data for

one typical bird in which we could register activity from six distinct single units

throughout the presentation of all stimuli. Note that the qualitative property for

the obtained results was similar in the other birds. For each stimulus, we calcu-

lated a population activity vector, which is the set of instantaneous mean ﬁring

rates for each neuron in a 50 ms time window. In this case, each population

activity vector had six dimensions. Since this typical bird had four song ele-

ments, we calculated 16 population activity vectors for 16 element pair stimuli,

within the 50 ms time window. The time window shifted by increments of 1 ms

from -200 to 600 ms (stimulus onset = 0 ms). The data were smoothed using a

Gaussian ﬁlter with a variance of 10 ms. These procedures enabled us to observe

the temporal aspects of the neuronal population. The multidimensional scaling

method (MDS) [11] is a dimension-reduction method that rearranges data from

a high-dimensional space into a lower-dimensional space, while preserving as

much of the information as possible. MDS was applied to the set of population

activity vectors for each time window. Finally, the population response to each

stimulus was represented in two-dimensional MDS space, and the clustering of

these responses was analyzed.

2.6

Information-Theoretic Analysis

To evaluate how much information is transmitted by each neuron, we calcu-

lated the mutual information between the stimulus and the neural response [12].

Mutual information was quantiﬁed as the decrease in entropy of the stimulus

occurrence:

I(S; R) = H(S) − H(S|R),

(2)

−

p(s) log p(s) − −

p(s|r) log p(s|r)

In this equation, S is the set of stimuli s, and R is the set of neural responses

r, i.e., spike count. p(s|r)is the conditional probability of stimulus s given an

observed spike count r , and p(s) is the a priori probability of stimulus s. The

brackets indicate an average of the signal distribution p(r). To examine the time

course of the information, the response was evaluated using a 50 ms sliding

window. The center of the window was moved in 10 ms steps, beginning 200

ms before the stimulus onset and lasting until 600 ms after the stimulus. To

test the statistical signiﬁcance, we estimated the mean and standard deviation

of the information during the 200 ms period before the stimulus onset. If the

value exceeded the mean + 3SD, we considered the information signiﬁcant (P <

0.001).

J. Nishikawa, M. Okada, and K. Okanoya

Results

3.1

Selective Auditory Response to BOS

We used a spike sorting procedure to classify the signals recorded from the

Bengalese ﬁnch HVC, which yielded well identiﬁed single units (n = 104, 23

birds). We analyzed these data using the psychophysical measure d . In total,

86% of HVC neurons selectively responded to BOS compared to the silent con-

dition (d (RS

BOS

/RS

Baseline

) > 1.0, 89/104 cells). In addition, 63% of neu-

rons were more responsive to BOS than to REV (d (RS

BOS

/RS

REV

) > 1.0,

65/104 cells), and 28% of neurons were more responsive to BOS than to OREV

(d (RS

BOS

/RS

OREV

) > 1.0, 29/104 cells). These results are consistent with

past studies [4]. The mean of d (RS

BOS

/RS

REV

) was 1.25, and the mean of

d (RS

BOS

/RS

OREV

) was 0.63. These results indicate that BOS-selective neu-

rons in HVC are largely variable, especially in terms of sequential response

properties.

A. Elements

-0.2

0.5 (s)

100

ret

B. Element pairs

-0.2

0.5 (s)

100

ret

C A

C B

C C

C D

D A

D B

D C

D D

A A

A B

A C

A D

B A

B B

B C

B D

Fig. 1. An example of the auditory response to song elements (A) and element pairs

(B) in a single unit from HVC

Population Coding of Song Element Sequence

Fig. 2. Song transition matrices of self-generated songs (ﬁrst row) and sequential re-

sponse distribution matrices of each single unit (second to seventh rows)

3.2

Responses to Song Element Pair Stimuli

To investigate the neural selectivity to song element sequences, we recorded neu-

ral responses to all possible element pair stimuli. Because the playback of these

stimuli is extremely time-consuming, we could only maintain 34% of the recorded

single units stable throughout the entire presentation (35/104 cells, 12/23 birds).

In total, 70% of the stable single-units were BOS-selective (d (RS

BOS

/RS

REV

)

> 1.0, 27/35 cells, 12/12 birds). Thereafter, we focused on these data.

A typical example of neural responses to each song element is shown in

Fig. 1 (A). The neuron responded to a single element A or C with single pha-

sic activity, but it did not respond to element B. It responded to element D with

double phasic activity. These results indicate that the neuron has various response

properties even during single element presentation. In addition, the neuron exhib-

ited more complex response properties during the presentation of element pairs

(Fig. 1(B)). The neuron responded more strongly to most of the element pairs

when the second element was A or C, compared to single presentation of each ele-

ment. However, the response was weaker when the ﬁrst and second elements were

the same. When the second element was B, no diﬀerences were observed between

single and paired stimuli. When the second element was D, we measured single

J. Nishikawa, M. Okada, and K. Okanoya

phasic responses, and a strong response to BD. These response properties were

not correlated with the element-to-element transition probabilities in the song

structure. The dotted boxes indicate the sequences included in BOS. However,

the neuron responded only weakly to some sequences that were included in BOS

(brack arrows). In contrast, the neuron responded strongly to other sequences that

were not included in BOS (white arrows). Thus, the neuron had broad response

properties to song element pairs beyond the structure of self-generated song.

To quantitatively evaluate sequential response properties, we calculated the

response strength measure d (RS

/RS

Baseline

) to the element pair stimuli S.

The sequential response distributions were created for each neuron in two indi-

viduals with more than ﬁve well identiﬁed single units. Song transition matrices

and sequential response distributions are shown in Fig. 2. The response distribu-

tions were not correlated with the associated song transition matrices. However,

each HVC neuron in the same individual had broad but diﬀerent response distri-

bution properties. This tendency was consistent among individuals. This result

indicates that the song element sequence is encoded at the population level,

within broadly but diﬀerentially selective HVC neurons.

3.3

Population Dynamics Analysis

To analyze the information coding of song element sequences at the population

level, we calculated the time course of population activity vectors, which is the

set of instantaneous mean ﬁring rates for each neuron in a 50 ms time win-

dow. Snapshots of population responses to stimuli are shown in eight panels of

Fig. 3A (n = 6, bird 2 of Fig. 2). Each point in the panel represents the popu-

lation vector toward each stimulus on the MDS space. The ellipses in the upper

four panels indicate the group of vectors whose stimuli have the same ﬁrst el-

ement, while the ellipses in the lower four panels indicate the group of vectors

whose stimuli have the same second element. Note that the population activ-

ity vectors in the upper four panels are identical to those in the bottom four

panels, and only the ellipses diﬀer. Before the stimulus presentation ([-155 ms:

-105 ms], upper and lower panels), only spontaneous activities were observed

around the origin. After the ﬁrst element presentation ([50 ms: 100 ms], upper

panel), groups with the same ﬁrst elements split. After the second element pre-

sentation ([131 ms: 181 ms] of the lower panel), groups with the same second

elements were still largely overlapping. In the next section, we will show that

confounded information, which represents the relation between ﬁrst and second

elements, increased signiﬁcantly in this timing. After suﬃcient time ([480 ms:

530 ms], upper and lower panels), the neurons returned to spontaneous activity.

The result indicates that the population response to the ﬁrst and second element

is drastically diﬀerent. Subsequently, we will show that this overlap is derived

from the information in the song element sequence.

3.4

Information-Theoretic Analysis

To determine the origin of the overlap in the population response, we calculated

the time course of mutual information between the stimulus and neural activity.

Population Coding of Song Element Sequence

Fig. 3. Responses of HVC neurons at the population level (A) and encoded informa-

tion (B)

The mutual information for ﬁrst elements I(S

; R), the second elements I(S

; R),

and that of element pairs I(S

, S

; R) was calculated within each time window;

the window was shifted to analyze the temporal dynamics of information coding

(left upper 3 graphs in Fig. 3B). Narrow lines in each graph indicate the cumula-

tive trace of mutual information in each neuron. The thick line is the cumulative

trace of all neurons in the individual. The bottom-left graph in Fig. 3B shows the

probability of stimulus presentation. After the presentation of the ﬁrst elements,

mutual information for the ﬁrst elements increased, showing a statistically sig-

niﬁcant peak (P < 0.001). After the presentation of the second elements, mutual

information for the second elements signiﬁcantly increased (P < 0.001). At the

J. Nishikawa, M. Okada, and K. Okanoya

same time, mutual information for element pairs also showed a signiﬁcant peak

(P < 0.001). Intuitively, information for element pairs I(S

, S

; R) would con-

sist of information for the ﬁrst elements I(S

; R) and second elements I(S

; R).

However, the consecutive calculation of I(S

, S

; R) − I(S

; R) in

each time window causes a statistical peak after the presentation of element

pairs (P < 0.001; forth graph from left in Fig. 3B). The diﬀerence C represents

the conditional mutual information between the ﬁrst and second elements for a

given neural response, otherwise known as confounded information [13]. There-

fore, confounded information represents the relationship between the ﬁrst and

second elements encoded in the neural responses. The I(S

; R) peak occurred

at the same time that groups of population vectors with the same ﬁrst elements

were splitting ([50 ms: 100 ms]). The peaks for I(S

; R), I(S

, S

; R), and C

occurred during the same time that groups with the same second elements were

still largely overlapping ([131 ms: 181 ms]). This indicates that the sequential

information causes an overlap in the population response.

In the population dynamics analysis, we cannot combine the data from diﬀer-

ent birds because each bird has a diﬀerent number and types of song elements.

However, in the mutual information analysis, we can combine and average the

data from diﬀerent birds. The ﬁve graphs on the right in Fig. 3B show the time

courses for I(S

; R), I(S

, S

; R), C, and the stimulus presentation

probability, which were calculated from all stable single units with BOS selectiv-

ity (n = 27, 12 birds). The combined mutual information for ﬁrst elements was

very similar to that from one bird, showing a signiﬁcant peak after the presenta-

tion of the ﬁrst elements (P < 0.001). Mutual information for second elements,

element pairs, and confounded information also had signiﬁcant peaks after the

presentation of the second elements (P < 0.001). These results show that the

song element sequence is encoded into a neural ensemble in HVC by population

coding.

Conclusion

In this study, we recorded auditory responses to all possible element pair stimuli

from the Bengalese ﬁnch HVC. By determining the sequential response distribu-

tions for each neuron, we showed that each neuron in HVC has broad but diﬀer-

ential response properties to song element sequences. The population dynamics

analysis revealed that population activity vectors overlap after the presentation

of element pairs. Using mutual information analysis, we demonstrated that this

overlap in the population response is due to confounded information, namely,

the sequential information of song elements. These results indicate that the song

element sequence is encoded into the HVC microcircuit at the population level.

Song element sequences are encoded in a neural ensemble with broad and dif-

ferentially selective neuronal populations, rather than the chain-like model of

diﬀerential TCS neurons.

Population Coding of Song Element Sequence

Acknowledgment

This study was partially supported by the RIKEN Brain Science Institute, and

by a Grant-in-Aid for young scientists (B) No. 18700303 from the Japanese

Ministry of Education, Culture, Sports, Science, and Technology.

References

1. Okanoya, K.: The Bengalese ﬁnch: a window on the behavioral neurobiology of

birdsong syntax. Ann. N.Y. Acad. Sci. 1016, 724–735 (2004)

2. Doupe, A.J., Kuhl, P.K.: The Bengalese ﬁnch: a window on the behavioral neu-

robiology of birdsong syntax. Birdsong and human speech: common themes and

mechanisms. Annu. Rev. Neurosci. 22, 567–631 (1999)

3. Margoliash, D., Fortune, E.S.: Temporal and harmonic combination-selective neu-

rons in the zebra ﬁnch’ s HVc. J. Neurosci. 12, 4309–4326 (1992)

4. Lewicki, M.S., Arthur, B.J.: Hierarchical organization of auditory temporal context

sensitivity. J. Neurosci. 16, 6987–6998 (1996)

5. Drew, P.J., Abbott, L.F.: Model of song selectivity and sequence generation in area

HVc of the songbird. J. Neurophysiol. 89, 2697–2706 (2003)

6. Deneve, S., Latham, P.E., Pouget, A.: Reading population codes: a neural imple-

mentation of ideal observers. Nat. Neurosci. 2, 740–745 (2001)

7. Pouget, A., Dayan, P., Zemel, R.: Information processing with population codes.

Nat. Rev. Neurosci. 1, 125–132 (2000)

8. Green, D., Swets, J.: Signal Detection Theory and Psychophysics. Wiley, New York

(1966)

9. Theunissen, F.E., Doupe, A.J.: Temporal and spectral sensitivity of complex audi-

tory neurons in the nucleus HVc of male zebra ﬁnches. J. Neurosci. 18, 3786–3802

(1998)

10. Matsumoto, N., Okada, M., Sugase-Miyamoto, Y., Yamane, S., Kawano, K.: Popu-

lation dynamics of face-responsive neurons in the inferior temporal cortex. Cerebr.

Cort. 15, 1103–1112 (2005)

11. Gower, J.C.: Some distance properties of latent root and vector methods used in

multivariate analysis. Biometrika 53, 325–328 (1966)

12. Sugase, Y., Yamane, S., Ueno, S., Kawano, K.: Global and ﬁne information coded

by single neurons in the temporal visual cortex. Nature 400, 869–873 (1999)

13. Reich, D.S., Mechler, F., Victor, J.D.: Formal and attribute-speciﬁc information in

primary visual cortex. J. Neurophysiol. 85, 305–318 (2001)

M. Ishikawa et al. (Eds.): ICONIP 2007, Part I, LNCS 4984, pp. 64–72, 2008.

Download 12.42 Mb.

Do'stlaringiz bilan baham:

1 2 3 4 5 6 7 8 9 ... 88