Building the European Social Innovation Database with natural language processing and machine learning
Gök, Abdullah and Antai, Roseline and Milošević, Nikola and Al-Nabki, Wesam (2022) Building the European Social Innovation Database with natural language processing and machine learning. Scientific Data, 9 (1). 697. ISSN 2052-4463 (https://doi.org/10.1038/s41597-022-01818-0)
Preview |
Text.
Filename: Gok_etal_SD_2022_Building_the_European_Social_Innovation_Database_with_natural_language_processing.pdf
Final Published Version License: Download (1MB)| Preview |
Abstract
Social innovation is widely defined as technological and non-technological new products, services or models that simultaneously meet social needs and create new social relationships or collaborations. Despite a significant interest in the concept, the lack of reliable and comprehensive data is a barrier for social science research. We created the European Social Innovation Database (ESID) to address this gap. ESID is based on the idea of large-scale collection of unstructured web site text to classify and characterise social innovation projects from around the world. We use advanced machine learning techniques to extract features such as social innovation dimensions, project locations, summaries, and topics, among others. Our models perform as high as 0.90 F1. ESID currently includes 11,468 projects from 159 countries. ESID data is available freely and also presented in a web-based app. Our future workplan includes expansion (i.e., increasing the number of projects), extension (i.e., adding new variables) and dynamic retrieval (i.e., retrieving and extracting information in regular intervals).
ORCID iDs
Gök, Abdullah ORCID: https://orcid.org/0000-0002-9378-3336, Antai, Roseline ORCID: https://orcid.org/0000-0001-5301-421X, Milošević, Nikola and Al-Nabki, Wesam;-
-
Item type: Article ID code: 83182 Dates: DateEvent12 November 2022Published31 October 2022Accepted21 April 2022SubmittedSubjects: Science > Mathematics > Electronic computers. Computer science
Social Sciences > Economic Theory > Income. Factor shares > Entrepreneurship. Risk and uncertaintyDepartment: Strathclyde Business School > Hunter Centre for Entrepreneurship, Strategy and Innovation Depositing user: Pure Administrator Date deposited: 14 Nov 2022 14:20 Last modified: 11 Nov 2024 13:41 URI: https://strathprints.strath.ac.uk/id/eprint/83182