The future of data.ox.ac.uk

In the spirit of Alex Bilbie’s post about the future of data.lincoln.ac.uk, and Southampton’s post about the technical background and direction of data.southampton.ac.uk, I thought it’d be a good idea to chime in with where data.ox.ac.uk is going. Continue reading

Posted in Uncategorized | Leave a comment

The Vacancies Dataset

The vacancies dataset contains job listings from the University’s central recruitment site.

Updates and timeliness

We currently check for new vacancies every fifteen minutes.

Modelling and data

The data are represented in RDF, primarily using the vacancies vocabulary. Data are stored across two RDF graphs, one for current vacancies, and another for those that have closed.

We also take a copy of the associated documents, which are stored on source.data.ox.ac.uk. Where possible we pull out the plain text of each document and include it in the RDF. At some point we aim to implement full-text search, which should make it possible to search for phrases within job descriptions.

Querying the data

In addition to using SPARQL, you can retrieve vacancy data using the following methods:

Feeds

It’s possible to get the data as RSS and Atom using URLs of the following formats:

http://data.ox.ac.uk/feeds/vacancies/xxxxxxxx[.format]
This will return all vacancies within the unit with the OxPoints ID of xxxxxxxx. It will exclude any attached to its sub-units, so if you ask for the Social Sciences Division, you won’t get back anything in Classics. You can find the base URL by finding the unit you want on this page. Once you’ve found the base URL you can append the name of a format. We currently support various RDF serializations, RSS and RSS2.0, and Atom.
http://data.ox.ac.uk/feeds/all-vacancies/xxxxxxxx.format
This is the same as above, but includes vacancies advertised as being within the sub-units of the requested unit. So in this case you’d also get vacancies in Classics when you ask for the Social Sciences Division. This means that you can get a list of all vacancies within the University hierarchy from http://data.ox.ac.uk/feeds/all-vacancies/00000000.

In both of these cases you can filter by adding a ?keyword=keyword parameter to filter by a sub-string.

For the RSS and Atom feeds we’ve given the closing date as the publication date, which should help when re-displaying these feeds as part of another website.

Bugs and limitations

At the moment we only pull data from the University’s recruitment site, which excludes collegiate appointments. We’re working to re-integrate work that was done to pull additional vacancies from www.jobs.ac.uk.

Salary information is currently just plain text. In due course we plan to model the University’s salary scales, which we can then link to. Once that is done it’ll be possible to perform range-based SPARQL queries on grade and upper and lower annual remunerations.

Our location parsing isn’t perfect, so we sometimes assign vacancies to the wrong unit. We hope to make this a bit cleverer soon!

Posted in Uncategorized | Tagged , | Leave a comment

About the University of Oxford’s open data service

data.ox.ac.uk is intended to be a repository for the University’s institutional open data. By collecting these data together and making them available in a machine-readable and re-usable way we hope to:

  1. Improve information flows in and around the University, allowing groups and departments to easily ingest and re-present data produced by another part of the University.
  2. Enable other people to use the data in new and innovative ways; to re-express the data to show something previously unknown.

To do this we’re creating a platform for storing and querying these data using standards-based technologies. A lot of the data will be available as RDF and queryable using SPARQL. However, where we can we are producing web-friendly APIs for those who want a low barrier to getting started and who aren’t familiar with RDF and SPARQL. It’s also possible to download the data in bulk for processing and manipulation off-line.

At the moment the service is in beta; it’s not officially supported and there’s no official commitment to keep it running. This may change as we demonstrate its usefulness.

Posted in Uncategorized | Leave a comment