==DATASET FOR: EXPLORING THE CONCEPT OF PID LITERACY: USER PERCEPTIONS AND UNDERSTANDING OF PERSISTENT IDENTIFIERS IN SUPPORT OF OPEN SCHOLARLY INFRASTRUCTURE== ==GEORGE MACGREGOR | https://purl.org/g3om4c | 2022-10-14 == The README file documents the nature of the data, and its structure, underpinning analysis in the following paper (RIS format): TY - UNPB T1 - Exploring the concept of PID literacy T2 - user perceptions and understanding of persistent identifiers in support of open scholarly infrastructure AU - Macgregor, George AU - Lancho Barrantes, Barbara S. AU - Rasmussen Pennington, Diane PY - 2022/10/28 Y1 - 2022 N2 - Abstract TBC KW - institutional repositories KW - scholarly graph KW - PID graph KW - persistent identifiers KW - information retrieval KW - open scholarly infrastructure UR - https://doi.org/=================TBC M3 - Working paper ER - ==DATA DESCRIPTION== -------------------- This dataset underpins analysis contained in the above noted paper and other forthcoming papers. Data are made available in a series of spreadsheet files (.ods), each corresponding to substantive sections of the accompanying research instrument. Statistical tests were performed using the Real Statistics Data Analysis Tool add-in for MS Excel (https://www.real-statistics.com/). Note that many spreadsheet column data are numbered according to their task/question number in the research instrument. There are 5 .ods files in total. ==CSE DATA== ------------ Filename: Macgregor-etal-pid-understanding-and-perceptions-computer-efficacy-data.ods This file contains 4 spreadsheet tabs: CSE data; CSE scores per discipline; CSE Discipline Games Howell; CSE Role Games Howell. 1. CSE data: Tab contains results from computer self-efficacy (CSE) section of the research instrument. Results for each question and each pariticpant is provided. Measures of central tendency are reported. The discipline and job role from which each participant originates is also provided and summarized. 2. CSE scores per discipline: This tab contains similar data to tab #1 but includes different data summarization. Included here are CSE scores by discipline affiliation of the participants (e.g. measures of central tendency). 3. CSE Discipline Games Howell: Data of ANOVA of CSE scores across discipline groups with post-hoc comparisons using the Games-Howell post-hoc procedures. 4. CSE Role Games Howell: Data of ANOVA of CSE scores across job role groups with post-hoc comparisons using the Games-Howell post-hoc procedures. ==PID CHALLENGE DATA== ---------------------- Filename: Macgregor-etal-pid-understanding-and-perceptions-semantic-challenge-data.ods This file contains 8 spreadsheet tabs. 1. PID screen data (totals & job): Data relating to PID screen challenges, with totals across all participants and totals of participants segmented by job role. 2. PID screen data (discipline): Data relating to PID screen challenges, with totals across all participants by dicipline. 3. PID screen data (job): Further data summarization of data presented in tab 1. Includes measures of central tendency for participant performance on PID screen challenges. 4. Welch ANOVA PID discipline: Tests performed between discipline groups based on participant scores in PID screen challenge. 5. Welch ANOVA PID job: Test performed between job role groups based on participant scores in the PID screen challenge. Measures of central tendency and additional data summarization provided. Also includes tests performed between job role groups based on participants *incorrect* results in the PID screen challenge. 6. PID recognition data: Data pertaining to the six PID recognition challenges. Results from the recognition challenges summarized by PID type (e.g. DOI, Handle.net, ORCID, etc.) and participant response. Responses also summarized by percentage according to total number of participants. 7. PID recognition chart: Data in this sheet includes summarized data from tab 6, used to generate a bar chart for inclusion in the research article. 8. PID recognition Kruskal-Wallis: Tests performed between discipline and job role groups as per participant recogition of PIDs within PID recognition tests. ==PID PERCEPTION DATA== ----------------------- Filename: Macgregor-etal-pid-understanding-and-perceptions-perception-data.ods This file contains 8 spreadsheet tabs, each relating to the PID perception tests (based on the semantic differential scale approach), the resulting data, and their analysis. 1. Semantic differential - all: This tab contains data for all participants from the PID perception tests. This includes the summarization of the semantic differential data according to the semantic dimensions (Evaluative, Potency, Activity) and by concept, e.g. Scholarly communication, People, Places & Things. Semantic distance using the generalized distance formula is also calculated on this sheet. 2. Semantic differential - physica: This tab contains data the same data, calculations, and analysis as tab #1 above but limits data to participants from the *Physical sciences* discipline. 3. Semantic differential - social: This tab contains data the same data, calculations, and analysis as tab #1 above but limits data to participants from the *Social sciences* discipline. 4. Semantic differential - life sc: This tab contains data the same data, calculations, and analysis as tab #1 above but limits data to participants from the *Life sciences* discipline. 5. Distances between groups: Data relating to Wilcoxon singed-rank tests to determine whether semantic distance (D) between specific discipline groupings was significant. 6. Distance PS: Data pertaining to PID perceptions organized by individual participant from the *Physical sciences* discipline only. Contains summarized performance on the semantic dimensions and includes semantic distances between semantic factors. 7. Distance SS: Data is the same as described in tab #6 but relates to *Social sciences* participants only. 8: Distance LS: Data is the same as described in tab #6 but relates to *Life sciences* participants only. ==SEMANTIC DIFFERENTIAL CHART DATA== ------------------------------------ Filename: Macgregor-etal-pid-understanding-and-perceptions-semantic-differential-chart-data.ods This file contains data used to generate the semantic measurement charts used in the article and the charts themselves. 1. Semantic differential chart ALL: The data and chart in this tab pertain to ALL participants' PID perception measurements by semantic concept (MEAN derived from the PID perception file) and semantic dimension. 2. Semantic differential chart PS: The data and chart in this tab is a subset of data from tab #1 and pertains to *Physcial sciences* participants' PID perception measurements by semantic concept (MEAN derived from the PID perception file) and semantic dimension. 3. Semantic differential chart SS: The data and the chart in this tab is the same as tab #2 but pertains to *Social sciences*. 4. Semantic differential chart LS: The data and the chart in this tab is the same as tab #2 but pertains to *Life sciences*. ==PID REUSE DATA== ------------------ Filename: Macgregor-etal-pid-understanding-and-perceptions-reuse-data.ods This file contains PID (re)use data, elicited from the final section of the research instrument. There is only one tab (PID use data). Summarization and measures of central tendency for question responses is included, for all participants, by discipline, and by job role. Analysis of data includes the chart used in the article to describe participants' PID creation and reuse. ==LICENCING== ------------- This work is licensed under the Creative Commons Attribution 4.0 International Public License. See https://creativecommons.org/licenses/by/4.0/legalcode for further details.