Bring Your Own Data: R-coding for analysing RNA-seq data
Important: for in person courses you need to bring your own laptop.
Target Audience
Basic understanding of R and R-studio is required. If you do not have this yet, consider to take the course “Introduction to R for Life Sciences” first. In addition, the course will be easier to follow with some knowledge of the R package ggplot2. If you do not have this yet, consider taking the course Bring your Own Data: create figures in R using ggplot2.
Course description
During this BYOD course you learn processing RNA-seq (and similar types of) data by applying lessons to your own data. If you do not have your own data, you can use data from colleagues or download a data set from a paper or a database like the EBI expression atlas (www.ebi.ac.uk/gxa). Your data should be a count table that you can load into R, so no raw reads!
This course is aimed at PhD candidates proficient in R programming but with little or no experience in analysing RNA-seq data. The course is also well-suited for PhD candidates who have analysed RNA-seq data before, but seek a deeper understanding of the many possible analyses.
When you register for this course, we expect you to be proficient in basic R. See if the following statements apply to test if this is the case for you:
- You are comfortable loading, exploring and manipulating data in R.
- You need little effort to work with various data types (numeric, character, logical, factor) and various object types (vector, matrix, data frame).
- Making subsets, reordering data, and making simple plots is intuitive.
- You can write an R script that logically processes data to some end result, such as a figure or summarising table.
- With some effort, you can write simple functions and use apply and for-loops to process your data.
- You understand basic statistics such as t-tests, ANOVAs and lineair models (knowledge included in the Introductory to Biostatistics course.
- You can apply skills on the Base R cheatsheet without help: https://posit.co/wp-content/uploads/2022/10/base-r.pdf
Experience in the following topics will help you, but is not required:
- Experience with ggplot2 for making publication-quality figures (knowledge included in the BYOD ggplot2 course).
- Experience fitting linear models to data.
- Rmarkdown and Rstudio: if you do not meet this level of R programming yet, be welcome in the Intro to R course we offer via the PhD Course Centre. Of you do neet meet the level of statistics yet, be welcome to follow the Introductory to Biostatistics course.
During this course you will work on your own laptop with R and Rstudio installed!
Learning objectives
After successfully completing this course, you:
- Understand the characteristics of RNA-seq data in general and your specific dataset in particular.
- Have an overview and understanding of the main steps in any RNA-seq analysis.
- Can design a simple workflow for processing data of various RNA-seq experimental setups.
- Base your analysis decisions on limitations, weaknesses and strong points of various types of analyses.
- Can execute and document these analyses in R(markdown).
- Can make important visualisations to describe your data, support analysis decisions and report statistical outcomes.
In this course, we will make many figures of RNA-seq data. The focus of these figures is a deeper understanding of the analysis and the data, not publication quality. We teach a separate BYOD course for this purpose: BYOD ggplot2. This ggplot2 course is a nice addition to elevate your figures to publication quality, for RNA-seq-related figures, but also for any other data type.
Instructional method
During this course, you will work on your own laptop. A week before the course starts, you will receive an email with instructions on how to install the required software. Classes during the week start with in-class lectures, hands-on exercises and Q&A sessions. As the week progresses, fewer exercises are provided and you will work predominantly on your own data. At the end of the course, you will shortly present and discuss one of the figures you have made during the course to share your progress.
Trainer
You will be trained by one of the professionals of Utrecht Bioinformatics Center, Utrecht University.
Group size
Groups are small to ensure sufficient supervision time for all data-set-specific questions. We allow 10 participants in the course. Depending on demand, the course may be organised more frequently.
Number of credits
1.5 EC
Study load
Next to the in-class lectures, hands-on exercises and Q&A sessions, you will spend the vast majority of your time on analysing your RNA-seq data during the week of the course. In this way you gain as much as possible from the in-class sessions.
Course certificate
You will receive a course certificate after actively participating in all course sessions and passing the Final Assignment.
Cancellation and No-show policy
This course is free for GSLS PhD candidates. However: free of charge does not mean free of responsibility. Once you have signed up for a course, we expect you to attend. For every late cancellation or no-show we have had to disappoint others who would have liked to attend. This is our policy:
- You may cancel free of charge up to 4 weeks before the start of the course. After this date you can only cancel if you have a GSLS PhD candidate to replace you in the course. Send the name and contact information of your replacement to pcc@uu.nl, at least 2 working days before the start of the course;
- We expect that you actively attend the full course, but at least 80%. It is mandatory to attend the first session. If you are absent the first session you cannot follow the remaining of the course;
- Not meeting the above requirements means you will be charged a no-show fee (€ 75). We will send the invoice after the course has ended. We are unable to make any exceptions, unless you have a valid reason (i.e., illness or death in the family 1st/2nd degree or partner). Your supervisor has to send an e-mail to pcc@uu.nl indicating the reason.
Unfortunately we don’t offer this course for participants not part of the GSLS. Our courses tend to be fully booked by GSLS PhD candidates
Entrance fee: This course is free for GSLS PhD candidates.
Location: Utrecht Science Park
Registration for this course opens 2 months before the course starts. You can register via our course portal (see registration link below). After opening, the portal shows how many spots are still available. You can subscribe to the interest list when a course is fully booked or not yet open for registration. When a new edition opens for registration you will receive an e-mail.