Comparative Assessment of Multimodal Sensor Data Quality Collected Using Android and iOS Smartphones in Real-World Settings

Sensors (Basel). 2024 Sep 26;24(19):6246. doi: 10.3390/s24196246.

Abstract

Healthcare researchers are increasingly utilizing smartphone sensor data as a scalable and cost-effective approach to studying individualized health-related behaviors in real-world settings. However, to develop reliable and robust digital behavioral signatures that may help in the early prediction of the individualized disease trajectory and future prognosis, there is a critical need to quantify the potential variability that may be present in the underlying sensor data due to variations in the smartphone hardware and software used by large population. Using sensor data collected in real-world settings from 3000 participants' smartphones for up to 84 days, we compared differences in the completeness, correctness, and consistency of the three most common smartphone sensors-the accelerometer, gyroscope, and GPS- within and across Android and iOS devices. Our findings show considerable variation in sensor data quality within and across Android and iOS devices. Sensor data from iOS devices showed significantly lower levels of anomalous point density (APD) compared to Android across all sensors (p < 1 × 10-4). iOS devices showed a considerably lower missing data ratio (MDR) for the accelerometer compared to the GPS data (p < 1 × 10-4). Notably, the quality features derived from raw sensor data across devices alone could predict the device type (Android vs. iOS) with an up to 0.98 accuracy 95% CI [0.977, 0.982]. Such significant differences in sensor data quantity and quality gathered from iOS and Android platforms could lead to considerable variation in health-related inference derived from heterogenous consumer-owned smartphones. Our research highlights the importance of assessing, measuring, and adjusting for such critical differences in smartphone sensor-based assessments. Understanding the factors contributing to the variation in sensor data based on daily device usage will help develop reliable, standardized, inclusive, and practically applicable digital behavioral patterns that may be linked to health outcomes in real-world settings.

Keywords: data quality; decentralized clinical study; digital health; digital signal processing; machine learning; model interpretability; multimodal sensing; smartphone sensors.

Publication types

  • Comparative Study

MeSH terms

  • Accelerometry* / instrumentation
  • Adult
  • Data Accuracy
  • Female
  • Humans
  • Male
  • Smartphone*

Grants and funding