Detailed information about the course

[ Back ]
Title

Advanced Data Management & Manipulation Using R

Dates

23 & 30 September 2021

Lang EN Workshop language is English
Organizer(s)

Dr Pauline Fritsch, Coordinator, Doctoral program in Organismal Biology, UNINE

Dr Marta Bellone, Coordinator CUSO-DPEE

Speakers

Dr. Jan Wunder, Wunder Consulting Wald (ZH)

Dr. Tina Cornioley, University of Zurich

Description

The analysis of large data sets ("big data") is becoming increasingly important in science and elsewhere. In this course you will learn how to use R to manage and manipulate large data sets, i.e. to sort, merge, subset, aggregate and reshape data, including outlier detection and gap filling algorithms.

For advanced data manipulation, we are going to use novel developments such as dplyr ("A Grammar of Data Manipulation"), the pipe operator (%>%) for simpler R-coding and data.table for the fast aggregation of large data sets. Furthermore, we will have a closer look at R-data base connections, SQL queries and the creation of new data bases from R.

Depending on the course progress, there will be scope for individuals to work on small projects and/ or their own data sets.

Program

Course outline:

  • Data structures
  • Data management (merge, sort, reshape,...)
  • "The data.table way" (data.table)
  • "The grammar of data manipulation" (dplyr)
  • Tidying up messy data (tidyr, NAs & outliers)
  • Databases (ODB)
  • Reporting (knitr)

 

The completion of an homework will be requested after the end of the course (deadline: TBA).

Location

University of Neuchâtel

Credits

1

Evaluation

Full attendance and active participation The completion of a homework will be requested after the end of the course (deadline: TBA)

Information

! Pre-requirements!

Familiarity with R before attending the workshop or previous attendance of an introductory course to R.

Bring your own laptop to the workshop with recent versions of R and R-Studio installed. Make sure that your laptop is properly connecting to the University of Neuchâtel or eduroam WLAN.

When?

23 and 30 September 20219:00 - 17:00

Where?

In agreement with the organizers, this workshop will be held either onsite in Neuchâtel or online. You may then register without fear of cancellation!

Questions?

Catherine Suarez
[email protected]

Expenses

Reimbursement:

Travel expenses:

PhD students of the DPEE are eligible for the reimbursement of incurred travel expenses by train (half-fare card, and 2nd class).

The online reimbursement system is now in place, so once the course is over you will be able to request the reimbursement via MyCUSO, without sending the paperwork to me. More information: Reimbursement

Accommodation:

Please contact the coordinator of the doctoral program (ecologie-evolution(at)cuso.ch) BEFORE the beginning of the course. NO reimbursement of accommodation without the coordinator of the doctoral program agreement prior of the course. In case of overnight stay, please post the original tickets and original bills along with the reimbursement form to:

Catherine Suarez
Département de Génétique et évolution - Unité d'anthropologie
Université de Genève
Sciences II, 30 quai Ernest-Ansermet, 1205 Genève

NO reimbursement of meal expenses

Registration

Registrations through the webpage of DPOB : https://www.unine.ch/dp-biol/home/program/courses2021/data2021.html

Deadline for registration 7.09.2021Priority is given to PhD students of the DPEE and DPOB until 29 August 2021. After this deadline, first comes, first serves! 

8 places are reserved for DPEE participants

Registration Fees:

Free for participants enrolled in the CUSO Ecology & Evolution doctoral program.
In case of cancellations, before the deadline: free
Late cancellations or no-show: Administrative fee of 50 CHF will be demanded
Other participants: please contact the program coordinator at ecologie-evolution(at)cuso.ch

Places

16

Deadline for registration
Joint activity joint
short-url short URL

short-url URL onepage