(c) pixabay cco


This course last ran on 31 January 2019. The next course will run in spring/summer 2020 (subject to demand)


UKCEH Wallingford, UKCEH Bangor or UKCEH Lancaster

Please express your interest here so we can fix a venue and date.


professionals £269 (Early Bird, then £ 299)

students £199 (Early Bird, then £229) 

Short Course Description:

This one-day course will give you an introduction to Linux and its ability to process big data files fast.  A short exercise will show the speed and efficiency of Linux tools compared to other data processing tools such as R or Excel.

You will log in to a training account and familiarise yourself with the Linux environment. You will learn to use basic commands such as ls, cd, mv, rm, less, and history.  We will discuss different types of regular expressions as used in Linux commands. You will do practical exercises and we encourage you to bring your own dataset.

You will learn advanced Linux tools such as grep, find, rsync, diff, cut, rename, awk and sed commands.  We will explore bash scripting. The course will end with a final file merging task you can solve in several different ways.  We will invite you to produce your own solution to produce the desired result.

Course Objectives:

  • Familiarise with the Linux environment and its ability to process large amounts of data fast with minimal resources. 
  • Apply knowledge to process and manage data files more efficiently and reliably.

Learning outcome:

You will have greater confidence and skills in using LINUX to prepare data for analysis and reformatting files. This will save you a lot of time in future data analysis


Accommodation is not included in the price. Accommodation guidance will be provided nearer the time.





There are no pre-requisites for the Linux appeal course. Experience of data processing is advantageous.

Course Leader:

Milo Brooks Technical Development and Field Instrumentation Engineer, UK Centre for Ecology & Hydrology.

Milo has a BSc in Mathematics and Computer Science from McGill University.  He has over 25 years of experience in data processing, data management and applications development.  In the past 5 years, he has used Linux tools and scripts to synchronise, process and aggregate UKCEH datasets for optimal performance, reliability and security.  Milo is an open-source technology enthusiast using it for environmental science data processing at work as well as for music, animation, CAD and 3D printing at home.

Target Audience:

  • MSc & PhD students
  • Early career researchers  and other academics
  • Those interested in Big Data and the internet of things

Previous Learners said:

"Thanks very much for the course! I gained  a good understanding on how data can be processed with LINUX."  (A course participant, 31 January 2019)

 Please express your interest here so we can fix a venue and date.

More detail & background:

From its humble origins as a project to put users in control of their own computing, Linux is now everywhere in our daily life.  It has been adapted to more platforms than any other in history.  When we consider smartphones and embedded systems, Linux is the most widely installed operating system in the world.  It is used almost exclusively by web and cloud computing servers.  All 500 of the world's fastest supercomputers run on Linux. It sounds too good to be true that a community developed technology can be so influential, but it is!  Join the millions of users worldwide and take part in the ever-growing open source movement.