History 3816G / Digital Humanities 3902G:

Introduction to Digital History

Tuesdays, 6pm

Room UC-222

Contact me

Devon Elliott

delliot8@uwo.ca

Office Hours: Tuesdays, 4:00 - 5:00pm, SSC 1004 or by appointment

Presentation

Tara
What Was There?
http://www.whatwasthere.com/

Presentation

Connor
123D Catch
http://www.123dapp.com/catch

Comparing and Sampling: Structured Data

Readings:

Comparing and Sampling: Structured Data

Technology:

  • OpenRefine

Structured vs. Unstructured Data

.CSV & .TSV files

CSV = comma-separated values

TSV = tab-separated values

These are basically structured textfiles, with one record on each line and fields of data delimitted by commas or tab spacing.

Often, the first line labels the fields for the records in the document.

Download a dataset

If you already have a .CSV or .TSV file, you can use that for the following examples. If not, we'll download a dataset with over 60,000 UFO sightings.

We'll use Wget to download just the .TSV data file. At the command line, enter:

wget https://raw.githubusercontent.com/johnmyleswhite/ML_for_Hackers/master/01-Introduction/data/ufo/ufo_awesome.tsv

Importing into Google Maps

http://maps.google.ca/

You can also try importing into Google Spreadsheets.

http://drive.google.com/

Intermission

OpenRefine

http://openrefine.org/

Download OpenRefine

http://openrefine.org/download.html

If you're using Yosemite, you might need to add Java.

Have a great week!

See you on April 7.

Contact me at delliot8@uwo.ca or stop by SSC 1004 on Tuesdays, 4:00-5:00. I'm also available before and after class on Tuesdays, or by appointment.