Webinar: Introduction to QAMyData ‘health-check’ tool for numeric data
2 December 2019
Online, 15.00 - 16.00 GMT
This webinar is an introduction to the new QAMyData tool for health-checking your numeric data, recently launched in November 2019.
The tool uses automated methods to detect and report on some of the most common problems found in survey or numeric data, such as missingness, duplication, outliers and direct identifiers. The open source tool helps data creators and users quality assess a numeric data file using a comprehensive list of ‘tests’, classified into types: file, metadata, data integrity, and direct identifiers. Popular file formats can be tested, including SPSS, Stata, SAS and CSV. The test configuration feature allows the creation of your own unique Data Quality Profile, that can play a useful role in your ‘FAIR’ data checking.
The webinar will describe the tests that are included in the tool, how to configure these to meet your own quality thresholds, and how to download the software from our Github page. We will also show our teaching exercise using messy data that can help promote data management skills.
The webinar will consist of a 30 minute presentation followed by 20 minutes for questions.
Presenters: Louise Corti, Cristina Magder and Myles Offord
Experience/knowledge required: Some knowledge of survey or numeric data
Target audience: Data publishers/data archivists, users of numeric data, peer reviewers of data, quantitative research lecturers