Cluster tendency assessment in neuronal spike data

Sara Mahallati; James C Bezdek; Milos R Popovic; Taufik A Valiante

doi:10.1371/journal.pone.0224547

Cluster tendency assessment in neuronal spike data

PLoS One. 2019 Nov 12;14(11):e0224547. doi: 10.1371/journal.pone.0224547. eCollection 2019.

Authors

Sara Mahallati^{1

2

3

4}, James C Bezdek⁵, Milos R Popovic^{1

2

4}, Taufik A Valiante^{1

3

6

4}

Affiliations

¹ Institute of Biomaterials and Biomedical Engineering, University of Toronto, Toronto, Canada.
² KITE Research Institute, University Health Network, Toronto, Canada.
³ Krembil Research Institute, University Health Network, Toronto, Canada.
⁴ CRANIA, University Health Network and University of Toronto, Toronto, Canada.
⁵ Computer Science and Information Systems Departments, University of Melbourne, Melbourne, Australia.
⁶ Division of Neurosurgery, University of Toronto, Toronto, Canada.

Abstract

Sorting spikes from extracellular recording into clusters associated with distinct single units (putative neurons) is a fundamental step in analyzing neuronal populations. Such spike sorting is intrinsically unsupervised, as the number of neurons are not known a priori. Therefor, any spike sorting is an unsupervised learning problem that requires either of the two approaches: specification of a fixed value k for the number of clusters to seek, or generation of candidate partitions for several possible values of c, followed by selection of a best candidate based on various post-clustering validation criteria. In this paper, we investigate the first approach and evaluate the utility of several methods for providing lower dimensional visualization of the cluster structure and on subsequent spike clustering. We also introduce a visualization technique called improved visual assessment of cluster tendency (iVAT) to estimate possible cluster structures in data without the need for dimensionality reduction. Experimental results are conducted on two datasets with ground truth labels. In data with a relatively small number of clusters, iVAT is beneficial in estimating the number of clusters to inform the initialization of clustering algorithms. With larger numbers of clusters, iVAT gives a useful estimate of the coarse cluster structure but sometimes fails to indicate the presumptive number of clusters. We show that noise associated with recording extracellular neuronal potentials can disrupt computational clustering schemes, highlighting the benefit of probabilistic clustering models. Our results show that t-Distributed Stochastic Neighbor Embedding (t-SNE) provides representations of the data that yield more accurate visualization of potential cluster structure to inform the clustering stage. Moreover, The clusters obtained using t-SNE features were more reliable than the clusters obtained using the other methods, which indicates that t-SNE can potentially be used for both visualization and to extract features to be used by any clustering algorithm.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Action Potentials / physiology*
Cluster Analysis
Computer Simulation
Models, Neurological*
Neurons / physiology*
Pattern Recognition, Automated
Signal Processing, Computer-Assisted*

Grants and funding

This research was supported by the Natural Sciences and Engineering Research Council of Canada. MRP was funded by Dean Connor and Maris Uffelmann Donation, Canadian Fund For Innovation and Natural Sciences and Engineering Research Council: Discovery Grant (RGPIN-2016-06358).