DADApy: Distance-based analysis of data-manifolds in Python

Patterns (N Y). 2022 Sep 19;3(10):100589. doi: 10.1016/j.patter.2022.100589. eCollection 2022 Oct 14.

Abstract

DADApy is a Python software package for analyzing and characterizing high-dimensional data manifolds. It provides methods for estimating the intrinsic dimension and the probability density, for performing density-based clustering, and for comparing different distance metrics. We review the main functionalities of the package and exemplify its usage in a synthetic dataset and in a real-world application. DADApy is freely available under the open-source Apache 2.0 license.

Keywords: density estimation; density-based clustering; feature selection; intrinsic dimension; manifold analysis; metric learning.