Data science as knowledge creation a framework for synergies between data analysts and domain professionals

van der Voort, Haiko and van Bulderen, Sabine and Cunningham, Scott and Janssen, Marijn (2021) Data science as knowledge creation a framework for synergies between data analysts and domain professionals. Technological Forecasting and Social Change, 173. 121160. ISSN 0040-1625 (https://doi.org/10.1016/j.techfore.2021.121160)

[thumbnail of van-der-Voort-etal-TFSC-2021-Data-science-as-knowledge-creation-a-framework-for-synergies-between-data-analysts]
Preview
Text. Filename: van_der_Voort_etal_TFSC_2021_Data_science_as_knowledge_creation_a_framework_for_synergies_between_data_analysts.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (2MB)| Preview

Abstract

The road from data generation to data use is commonly approached as a data-driven, functional process in which domain expertise is integrated as an afterthought. In this contribution we complement this functional view with an institutional view, that takes data analysis and domain professionalism as complementary (yet fallible) knowledge sources. We developed a framework that identifies and amplifies synergies between data analysts and domain professionals instead of taking one of them (i.e. data analytics) at the centre of the analytical process. The framework combines the often-cited CRISP-DM framework with a knowledge creation framework. The resulting framework is used in a data science project at a Dutch inspectorate that seeks to use data for risk-based inspection. The findings show first support of our framework. They also show that whereas more complex models have a higher predictive power, simpler models are sometimes preferred as they have the potential to create more synergies between inspectors and data analyst. Another issue driven by the integrated framework is about who of the involved actors should own the predictive model: data analysts or inspectors.