k-Shape clustering for extracting macro-patterns in intracranial pressure signals

Martinez-Tejada, Isabel; Riedel, Casper Schwartz; Juhler, Marianne; Andresen, Morten; Wilhjelm, Jens E.

doi:10.1186/s12987-022-00311-5

Research
Open access
Published: 05 February 2022

k-Shape clustering for extracting macro-patterns in intracranial pressure signals

Isabel Martinez-Tejada ORCID: orcid.org/0000-0003-4717-8597^1,2,
Casper Schwartz Riedel¹,
Marianne Juhler¹,
Morten Andresen¹ &
…
Jens E. Wilhjelm²

Fluids and Barriers of the CNS volume 19, Article number: 12 (2022) Cite this article

3961 Accesses
5 Citations
Metrics details

Abstract

Background

Intracranial pressure (ICP) monitoring is a core component of neurosurgical diagnostics. With the introduction of telemetric monitoring devices in the last years, ICP monitoring has become feasible in a broader clinical setting including monitoring during full mobilization and at home, where a greater diversity of ICP waveforms are present. The need for identification of these variations, the so-called macro-patterns lasting seconds to minutes—emerges as a potential tool for better understanding the physiological underpinnings of patient symptoms.

Methods

We introduce a new methodology that serves as a foundation for future automatic macro-pattern identification in the ICP signal to comprehensively understand the appearance and distribution of these macro-patterns in the ICP signal and their clinical significance. Specifically, we describe an algorithm based on k-Shape clustering to build a standard library of such macro-patterns.

Results

In total, seven macro-patterns were extracted from the ICP signals. This macro-pattern library may be used as a basis for the classification of new ICP variation distributions based on clinical disease entities.

Conclusions

We provide the starting point for future researchers to use a computational approach to characterize ICP recordings from a wide cohort of disorders.

Introduction

Intracranial pressure (ICP) monitoring is a mainstay of neurosurgical diagnostics both for intensive care management in acute neurosurgical conditions [1] and for aiding diagnosis in conditions outside the intensive care unit (ICU) for milder degrees of disease such as hydrocephalus, normal pressure hydrocephalus (NPH), or idiopathic intracranial hypertension (IIH).

In the clinical setting, ICP is often interpreted purely as a number within a certain range. Yet, ICP signals are complex time series with wave patterns that go beyond just a simple number. Analysis of ICP waveforms on either a subsecond beat-to-beat basis or in patterns over longer durations, the so-called macro-patterns, gives further insight into brain function [2]. Machine learning tools have the potential to identify these patterns faster and—more importantly—objectively, helping to characterize their appearance and distribution in a standardized fashion compared to the current primary visual inspection by clinicians. Until now, most studies have employed these techniques to analyze the ICP in acute conditions. Mariak et al. used artificial neural networks (ANN) to extract global properties of the entire ICP time series to assess the severity of the clinical state in intensive care patients [3]. Hornero et al. analyzed the complexity of the ICP signal estimated by approximate entropy (ApEn) to determine the presence of patterns in periods of acute elevations in ICP of pediatric patients in intensive care [4].

In the last decade, new telemetric ICP monitoring devices have become available, allowing easier access to perform ICP recordings that are representative of daily life conditions, compared to previous cable-based solutions [1, 5]. Thus, ICP can now be monitored in patients with milder degrees of disease in disease categories such as hydrocephalus, normal pressure hydrocephalus, or idiopathic intracranial hypertension. The ICP signals recorded with these systems ensure sufficient clinical and technical quality to be analyzed as part of the ICP interpretation procedure carried out by neurosurgeons and other clinicians [6,7,8], but the increased monitoring period and signal diversity also means that the analysis of ICP data becomes more demanding.

In this study, we explore the use of machine learning tools to extract macro-patterns from the ICP signal in a diverse cohort of patients with different disease entities. We introduce a new methodology based on k-Shape clustering as a basic building block for future day-to-day ICP evaluation and update of models on stored patient data. Given that telemetric ICP monitoring has allowed us to evaluate the patient’s ICP out of hospital borders, our main context for considering new macro-patterns moves away from ICP monitoring exclusively in the neurointensive care setting, where ICP variations are more accentuated. Specifically, our approach aims to permit a more adequate description of the longer timescale ICP variations seen in the broader clinical setting nowadays including disease types like NPH or IIH. Our approach created a universal library of representative macro-patterns that can later be used to automatically segment each individual ICP signal into shorter sequences based on clinical input. Also, we developed a template matching framework to classify these shorter sequences—which we will refer to as ICP subsequences—into what we estimate to be clinically significant macro-patterns. Finally, we propose a possible visualization strategy to display the pattern-annotated ICP signal in a fashion that is clinically useful.

Methods

Our goal was to create a scalable library of a few macro-pattern templates to use for ICP subsequence classification. We used k-Shape clustering as a method to efficiently group together subsequences characterized by their shape similarity despite differences in amplitude, duration and alignment. We first describe our data selection and processing approach for artifact removal. Next, we discuss our k-Shape based clustering approach to construct the templates. Finally, we show how the stored library can be used to characterize new incoming ICP signals by reproducible macro-patterns. The components of the entire approach are illustrated in Fig. 1.

Data selection

We used a collection of eight randomly selected anonymized overnight monitoring sessions that belong to different subjects from our database in the Department of Neurosurgery, Rigshospitalet, Denmark. A commercially available cable ICP probe (Neurovent-P; Raumedic AG, Germany) was used for these measurements. The length of the sessions spanned from nine to 22 h, summing up to a total of 88 h. The sampling frequency of the recordings was 100 Hz. The dataset was made up of five monitoring sessions (five patients) for a total of 88 h, and an additional set of three monitoring sessions (three patients) for a total of 55 h. By adding the latter dataset, template matching results can provide an indication of whether the algorithm is general enough to cover subjects with different disease entities.

Data preprocessing

The ICP signal recorded is often contaminated by very high and sharp spikes, with unphysiologically high values. These artifacts mask the characteristic appearance of the signal, rendering accurate pattern recognition impossible. We used an Empirical Mode Decomposition (EMD) based method for spike removal [9].

EMD decomposes the signal into a set of intrinsic mode functions (IMFs, i.e., IMF$_1$, IMF$_n$, ..., IMF$_N$). The first function of this set corresponds to fast oscillations, while the last one corresponds to the slowest ones. Therefore, the higher the IMF order, the lower will be its frequency content.The first IMFs, containing high-frequency oscillations, indicate the presence of artifacts. Because the spikes have band-limited waveforms, their dominant oscillations are found in a subset of consecutive IMFs. In our case, the location of unphysiologically high and rapid spikes aligned with the location of spike events in IMF$_{1}$ to IMF$_{4}$, so summing these four IMFs enhances spike episodes. The summation result reveals the peaks with dominant amplitude at the temporal location of the spike, and attenuates the effect of non-spike events. The term $g_r$ will be used to refer to the partially reconstructed signal calculated as the sum of the first to fourth IMFs.

To identify the peak events in $g_r$, an adaptive thresholding approach was implemented. ICP values outside the bounded region between [$-\eta _s$, $\eta _s$] were identified as spikes. The threshold was calculated as $\eta _s= \sigma \sqrt{2\cdot \log (L)}$, where $\sigma $ and L are the standard deviation (noise level) and number of samples of $g_r$, respectively. It is a universal threshold first proposed by Donoho and Johnstone [10] for determining a value above background noise. Identified spikes were then imputed with a moving average calculated over a sliding window of 10s.

Template library creation

We implemented the algorithm in MATLAB (R2020b; The MathWorks, Inc., Natick, MA.) using the platform: Intel®with core i7 processor and clock speed 2.6 GHz and 16 GB RAM.

Segmentation

Time series segmentation plays an important role in data mining and refers to the tool for decomposing the signal into a discrete number of contiguous subsequences. The proposed algorithm for segmentation of the ICP signal can be broken down into four sequential steps, as seen in Fig. 2. The following section will cover the details regarding each of the steps.

ICP segmentation was applied to divide the signal into subsequences of duration varying from seconds to minutes. This poses the challenge of deciding the time location at which to anchor both the start and end points of each subsequence. To address this problem, we first smoothed the signal via a linear phase finite impulse response (FIR) lowpass filter. The filtered signal will only be used in the segmentation step. The cut-off frequency ($F_{pass}$) was set to 0.05–0.1 Hz, depending on the degree of smoothing desired for the removal of cardiac and respiratory contributions in each subject. Other filter parameters were $F_{stop}$ = 0.02–0.05 Hz, $A_{pass}$ = 0.001 dB, $A_{stop}$ = 60 dB, and minimum order.

From the smoothed ICP signal, the major extrema were extracted (maxima and minima). Only the minima were used as the start and endpoints for each of the ICP subsequences. Because some minima were located very close—both time and amplitude wise—to a neighboring maximum, we implemented the following rule to identify suitable minima for the segmentation. If we suppose that the discrete ICP signal at this stage can be written as $g_n$, $n=1,2,...,N$ and indexed in time order $t_n$, $t=1,2,...,N$, we removed a minima $g_i$ from being a candidate as a boundary point if:

1)
the time difference between the minimum $g_i$ and its neighboring maximum $g_j$ was smaller than a predefined value $\eta _{dur}$, between 0.5 and 2 min, i.e. $|t_j-t_i|<\eta _{dur}$, or
2)
the magnitude difference between a minimum $g_i$ and its neighboring maximum $g_j$ was smaller than a predefined value $\eta _{mag}$, between 0.5 and 1.5, i.e. $|g_j-g_i|<\eta _{mag}$.

We can then define the segmented window (i.e., ICP subsequence) as g[i, j] with i and j corresponding to the discrete indices of the selected boundary points. An example of these steps is shown in Fig. 5A–C.

Z-normalization

Z-normalization of the derived subsequences was required before clustering. As many recent studies [11, 12] suggest, this procedure is necessary for data mining algorithms to deal with scale and translation invariance to prioritize shape features over amplitude ones. By z-normalizing each subsequence we ensured that they were linearly transformed to have zero mean and standard deviation close to one:

$$\begin{aligned} z(g[i,j])= \frac{g[i,j]-\mu _{g[i,j]}}{\sigma _{g[i,j]}} \end{aligned}$$

(1)

where $\mu _{g[i,j]}$ and $\sigma _{g[i,j]}$ refer to the mean and standard deviation of the ICP subsequence g[i, j], respectively. For the sake of simplicity, we will refer to each z-normalized ICP subsequence z(g[i, j]) as $z_{icp}$ in the rest of the paper.

k-Shape clustering

k-Shape was used to divide our extracted ICP subsequences into a number of characteristic-preserving groups, the so-called clusters, such that sequences in the same group were similar in shape. Each cluster is represented by a central vector, the centroid, which is not necessarily part of the original dataset [13]. Each centroid in k-Shape is determined as a sequence that minimizes the sum of squared distances to the rest of the z-normalized ICP subsequences. This novel centroid-based clustering algorithm is fundamentally a variant of k-means with a distance measure derived from the cross-correlation coefficient [14]. As a result, one template is built for each centroid and subsequently stored together with a class label.

Through an iterative procedure, k-Shape:

1)
assigned each z-normalized ICP subsequence to the centroid with the maximum shape similarity in the assignment step, and
2)
updated the centroids based on the new members of each cluster, in the refinement step.

The previous two steps of the algorithm were repeated either until there was no change in cluster configuration or until the maximum number of 100 iterations was reached [14].

Shape similarity was defined by the so-called Shape-Based Distance (SBD):

$$\begin{aligned} SBD(\overrightarrow{x},\overrightarrow{\text{c}_\text{k}}) = 1-max_w \left( \frac{CC_w(\overrightarrow{x},\overrightarrow{\text{c}_\text{k}})}{\sqrt{R_0(\overrightarrow{x},\overrightarrow{\text{x}})\cdot R_0(\overrightarrow{c_k},\overrightarrow{\text{c}_\text{k}})}} \right) \end{aligned}$$

(2)

where w is the position at which the cross-correlation $CC_w(\overrightarrow{x},\overrightarrow{\text{c}_\text{k}})$ between the z-normalized ICP subsequence ($\overrightarrow{x}=z_{icp}$) and the centroid vector of each cluster ($\overrightarrow{\text{c}_\text{k}}$) was maximized; and $R_0$ the geometric mean of autocorrelation of each individual sequence $\overrightarrow{x}$ or $\overrightarrow{c_k}$ [14]. Cross-correlation measures the degree of similarity between two time series, which in our case are $\overrightarrow{x}$ and $\overrightarrow{\text{c}_\text{k}}$, calculated as a function of the displacement of $\overrightarrow{x}$ over $\overrightarrow{\text{c}_\text{k}}$. Cross-correlation adds shift-invariance to the SBD measure and can be computed on sequences of different lengths.

Determining the optimal number of clusters, K, is a fundamental challenge within partitional clustering and unfortunately, there is not an ideal approach to identify K. Given that we had a large amount of data to be clustered into a number of clusters, and this number was dependent on medical practical experience, the need for an initial estimate of clusters is clear. We relied on a direct method, the so-called silhouette index, as the metric to evaluate the quality of the clustering structure. This metric evaluates the clustering quality based on the similarity between subsequences within the same cluster and across different clusters [15]:

$$\begin{aligned} S(i) = \frac{b(l)-a(i)}{max\{b(l),a(l)\}} \end{aligned}$$

(3)

In Eq. 3, a(l) is the average distance between subsequence l and every subsequence within the same cluster and b(l) is the minimum average distance between subsequence l and every subsequence in different clusters [16]. The optimal estimate of K was the value that maximized the silhouette metric over a range of possible values for K. The window of solutions for which the silhouette index was calculated ranged from 5 to 20.

Cluster validation

Visual inspection of the clustering results is crucial for verifying the accuracy of the partitioning. However, a visual approach is subject to the level of expertise and subjectivity of the investigator. Thus, visualization needs to be combined with standardized cluster validation indices (CVI) tailored to quantitatively evaluate clustering results. Quantitative evaluation of extracted clusters is not straightforward if there is a lack of annotated data. Thus, we need to rely on internal indices. Conclusions from previous studies have shown that there is no best single CVI in each context [17, 18]. Therefore, multiple validation indices will be used in the validation process: Silhouette Index, Davies–Bouldin index (DBI), and Calinski–Harabasz index (CHI).

Silhouette index, introduced in the previous section, is a common metric to measure how well an object lies within a cluster and our selected internal clustering validation index. DBI is the ratio between the average distance of all subsequences of each cluster to their respective centroids and the distance of the centroids of the two clusters, i.e., the ratio between within-cluster compactness and between-cluster separation [19, 20]:

$$\begin{aligned} DBI = \frac{1}{K} \sum _{a=1}^K max \Big \{ \frac{d_a+d_b}{d(c_a,c_b)} \Big \} \, {a\ne b} \end{aligned}$$

(4)

where K is the number of clusters, a, b are cluster labels, $d_a,d_b$ the average distance of all subsequences in clusters a and b to their respective centroids, and $d(c_a,c_b)$ the distance between centroids. Smaller values indicate better clustering results, as clusters are more separated from each other and less disperse within each cluster. To be in line with the rest of CVIs, we use $1-DBI$ for comparison of clustering results and thus higher values indicate better clustering solutions.

CHI relates the sum between the cluster dispersion calculated as the distance, $S_B$, between each within-cluster subsequence and its centroid, to the inter-cluster dispersion calculated as the distance ($S_W$) between each centroid to the global centroid (${\overline{c}}$) [21]:

$$\begin{aligned} CHI = \frac{tr(S_B)}{tr(S_W)}\cdot \frac{n_p-1}{n_p-K} \end{aligned}$$

(5)

where $S_B$ and $S_W$ are the between and within cluster scatter matrices, respectively, tr the trace defined by the sum of the elements of the main diagonal of the scatter matrices, K the number of clusters and $n_p$ the number of clustered subsequences. The higher the index value, the better the performance of the clustering.

Characterization of ICP signals

Shape-based template matching

The primary goal was to learn what the distinctive shapes for differentiating pattern clusters from each other were. Therefore, when an uncharacterized ICP subsequence entered into our system, we were able to automatically determine if it belonged to a template from the library of patterns or not. For labeling ICP subsequences based on the generated templates, new ICP subsequences from the additional dataset were retrieved and z-normalized to address scaling invariance. To deal with the horizontal shifts and stretching of the subsequence on the templates, we rescaled the time dimension. Query subsequences were then compared to each template for the closest match. For this comparison, we computed the SBD so that the shape similarity could be measured.

This template matching approach is done under the assumption that all queries must be classified to a template, even if the closest match shows a high SBD. This is why apart from defining our template library, we also defined a rule to ensure that the correlation to the closest match is meaningful. Although this parameter can be specified by the user, a reasonable rule is: $CC(z_{icp},\overrightarrow{c_k}) > 0.50$.

Classification visualization

The amount of data in each ICP recording is very large. With our current template library, we are able to classify a subset of the ICP signal. Visualizing this information must be presented to a clinical end-user in a fashion that is operationally useful. For this purpose, we represented each ICP subsequence as colored boxes with varying dimensions according to their characteristics (Fig. 3). The height of the box was defined by the difference between the absolute maximum and minimum values of the non-normalized sequence (of the raw unfiltered ICP signal), and the width by the duration of the sequence. The vertical center of the box corresponded to the median ICP value of the non-z-normalized subsequence. Each box was colored after the label their corresponding subsequence had been matched to, being black if the matching correlation coefficient was below 0.50.

Results

Data demographics

Eight patients were selected for the study: two male and six female. The pooled median age was 55 years; range: 20–74 years old. Subjects were fetched randomly from a continuously updated clinical ICP database. The clinical conditions were hydrocephalus, aneurysm and craniotomy, but signal analysis was performed on the anonymized recordings without reference to clinical information.

Data pre-processing

We decomposed the ICP signal via EMD into sixteen IMFs and a residual. Figure 4 shows an example of an ICP signal of one subject after EMD-based filtering, with unphysiologically high and rapid spikes removed.

On average, 18 spikes of less than one second duration are identified in each ICP monitoring. These spikes are found within a range that spans from two to 43 spikes per recording, that account on average for less that 0.000087$\%$ of the total monitoring time. Thus, removing the few samples corresponding to these spikes should not have any major consequences on later processing steps, especially since we will be looking at longer variations of the ICP signal.