Intro to Open Refine

University of Washington

Online

Feb 14, 2022

Instructors: Jessica Hymers

General Information

Library Carpentry is made by people working in library- and information-related roles to help you:

Library Carpentry introduces you to the fundamentals of computing and provides you with a platform for further self-directed learning. For more information on what we teach and why, please see our paper "Library Carpentry: software skills training for library professionals".

Who: The course is for people working in library- and information-related roles. You don't need to have any previous knowledge of the tools that will be presented at the workshop.

Where: This training will take place online.

Requirements: Participants must have access to a computer with a Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.) that they have administrative privileges on. They should have a few specific software packages installed (listed below).

Contact: Please email jessica.hymers@mail.utoronto.ca for more information.

Roles: To learn more about the roles at the workshop (who will be doing what), refer to our Workshop FAQ.


Code of Conduct

Everyone who participates in Carpentries activities is required to conform to the Code of Conduct. This document also outlines how to report an incident if needed.


Surveys

Please be sure to complete these surveys before and after the workshop.

Pre-workshop Survey

Post-workshop Survey


Schedule

00:00 Introduction
03:27 OpenRefine Setup
07:40 Introduction to OpenRefine
09:51 Importing Data to OpenRefine
12:13 Layout of OpenRefine
16:33 Facets
20:52 Introduction to Transformations
22:35 Writing Transformations
27:10 Exporting your work and Goodbye
28:36 END

Setup

To participate in a Library Carpentry workshop, you will need access to software as described below. In addition, you will need an up-to-date web browser.

We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.

OpenRefine

OpenRefine is a tool to clean up and organize messy data. Please find instructions to install it and the data used in the lesson in the Set-up Instructions.

To complete this lesson you will need to download the file doaj-article-sample.csv