OpenRefine

Using OpenRefine on Saagie

Docker creation

An is available at the following URL: https://hub.docker.com/r/bbonnin/openrefine/

In order to use it on Saagie, you need to create a job in the Docker capsule with the following configuration:

  • Docker image URL: bbonnin/openrefine:3.0-beta
  • External application access
    • Expose the application outside the platform through an HTTP port => Checked
    • The container HTTP port used by OpenRefine is the 3333.
  • You can use the custom URL options if you want.
  • External application authentication: keep private, otherwise anyone could access your work.
  • Internal application access => Unchecked
  • CPU: default value should be enough.
  • RAM: minimum: 1024 MB
  • Disk: as much as you need.

Connection to OpenRefine

Open your browser using the link provided in the Data Fabric or the one you provide in the configuration.

Import/Export

As the current image does not provide an efficient way to backup your projects, it is recommended to manually export your projects.

  • Go to OpenRefine
  • On the left, click on "Open Project" and select the project you want to export
  • In the upper right, cick on Export, then select "Export project"
  • Choose the destination

For importing a project:

  • Go to OpenRefine
  • On the left, click on "Import project"
  • Choose the location of your project and click on "Import Project"