In this tutorial you will learn to use Google Refine to clean a dataset and create files for analysis. We will perform some cleaning together in class.
Here are the slides from this tutorial
Getting Started
- Install Google Refine. You can install download and install from here:
http://openrefine.org/download.html
You should install the latest version at the top of the page (not any beta version).
- Download the following dataset: UniversityData.csv
References
The documentation for Google Refine / Open Refine is available here.
There are also a set of nice introductory tutorials available on YouTube: Part 1, Part 2, Part 3
Here are helpful pointers to the Open Refine Expression Language
If you have not been in class, follow this tutorial after having watched the videos above: https://web.archive.org/web/20190105063215/http://enipedia.tudelft.nl/wiki/OpenRefine_Tutorial