A gap remains between developing risk prediction models and deploying models to support real-world decision making, especially in high-stakes situations. Human-experts' reasoning abilities remain critical in identifying potential improvements and ensuring safety. We propose a thick data analytics (TDA) framework for eliciting and combining expert-human insight into the evaluation of models. The insight is threefold: (1) statistical methods are limited to using joint distributions of observable quantities for predictions but often there is more information available in a real-world than what is usable for algorithms, (2) domain experts can access more information (e.g., patient files) than an algorithm and bring additional knowledge into their assessments through leveraging insights and experiences, and (3) experts can re-frame and re-evaluate prediction problems to suit real-world situations. Here, we revisit an example of predicting temporal risk for intensive care admission within 24 hours of hospitalization. We propose a sampling procedure for identifying informative cases for deeper inspection. Expert feedback is used to understand sources of information to improve model development and deployment. We recommend model assessment based on objective evaluation metrics derived from subjective evaluations of the problem formulation. TDA insights facilitate iterative model development towards safer, actionable, and acceptable risk predictions.
Keywords: Algorithmic Audit; Machine Learning; Mixed Methods; Thick Description.