Data Visualisation ‹ Science Craft

[maxbutton id=”22″]

Day 1: Principles
Day2: Applications
Day 3: Interactive
Feedback from Previous Workshops
FAQs

Day 1: Principles of Data Visualisation

Understand How to Use Visuals to Efficiently Explore and Effectively Communicate Data

An Example

Over the course of the first day, participants develop an appreciation for the visual language of quantitative data. Using an understanding of visual perception, colour and data we explore different ways of representing the same data to not only explore it for ourselves, but communicate a meaningful result to our audience.

In this example from a previous participant, sugar concentrations were measured over time in peppermint plants grown under different conditions. Among the many ways in which this seemingly straight-forward data-set could be visualised, the grammar which most readily communicated a clear result was uncovered. In this case we can observe that sugar concentrations, in particular sucrose, increase over time in high CO₂ drought conditions.

Participants Learn How To:

Use Figures to effectively tell the story of their research.
Choose the most appropriate plot type for their data.
Understand the science of perception and apply design principles to communicate their message clearly.
Use aesthetics to relate sensory attributes (colour, shape, etc.) to abstractions (enrichment, change-over-time).
Distinguish between graphics for publication and for data exploration.
Understand the philosophy behind the Grammar of Graphics.

Our Teaching Approach Emphasises:

The participants’ own data – Participants submit their own figures or tables to be critiqued by the instructors.
Relevant examples – Representations of data from various sources help participants to identify the challenges they will face in their own work.
Effective use of figures – Figures are taught as a way to maximize the impact of a research article, reinforcing the written word.

Day 2: Programming Reproducible Data Visualisations

Learn Practical and Flexible Commands for Generating Meaningful Publication-quality Graphics using R

An Example

The second day begins with a hand-on tutorial in ggplot2 the R package which implements the grammar of graphics plotting concept introduced in the first day. All major grammatical elements are discussed and demonstrated using a built in data-set.

By the end of the day students should have developed a visual solution for a data-set they have brought to class.

Participants Learn How To:

Produce publication-quality figures directly in R.
Produce interactive visualizations for data exploration.
Generate non-standard plots, e.g. mosaic plots, venn diagrams and heat maps.

Our Teaching Approach Emphasises:

Practical code – Participants receive “cookbook” style code that they can immediately use or modify.
The Power of R – Participants learn how to take advantage of R’s functional and flexible capabilities, e.g. in generating non-standard plots.
Learning-by-doing – Key concepts are reinforced using hands-on practical exercises.

Key Features:

Instructor is a scientist with over 5 years of experience using R to prepare figures for scientific manuscripts.
The easy-to-use reference book includes ready-to-use code and techniques that can be quickly implemented.

Day 3: Building Interactive Interfaces

Develop tools to Building Interactive Interfaces for Dynamic Data Exploration

Description

Day 3 focuses on making interactive plots that can be shared with your colleagues or published on the web. We will explore interactivity in two ways.

First, we will learn about making interfaces that provides access to your data. This includes all the familiar interface features such as pulldown menus, check-boxes, radio buttons, etc. but also upload and download features to use the same script on different data sets or save the visualisations for publication.

An Example

In this example from a previous workshop, the student’s data-set consists of 4200 relative ΔC_T observations (a proxy for gene expression). These observations were contained in 279 unique combinations of 5 other variables: The experimental CO₂ concentration used, the number of days after hatching, the individual gene, and gene groups and the tank from which the sample came.

All these elements come together in an interactive interface, which also provides the ability to choose the plot geometry. Different plot geometries reveal different trends in the data-set. The average relative ΔC_T per tank is plotted upon activation of a check-box when the dot plot geometry is chosen.

Second, we will explore the use of tooltips and brushing to interact with the plot itself. An example of tooltips is shown in the following interactive binomial distributuion app. When the user hovers over a bar, the point probability is given. tooltips and brushing can be used with scatter plots and linked to tables, providing information on a single observation or many data points.

Feedback

[testimonialswidget_list limit=10 category=data-visualisation disable_quotes=true random=true enable_schema=”false”]

Understanding & Confidence

The Data Visualisation workshop enhances participants’ understanding of the importance of visually communicating their research results. Added to this, students leave the workshop with confidence that they can create impactful, well-designed figures using the R statistics package.

Survey Responses

The high overall score of the Data Visualisation workshop rests on the unique features of our training approach. Students single-out the personal knowledge and one-on-one attention of the instructor for praise. This approach helps them put the theory of Data Visualisation into practice when making their own figures. The tailor-made workshop manual is prized as a rich source of data visualisation ideas that inspires students to choose and implement optimal visualisations for their data.

Audience & Practicalities

Who is the target audience for this workshop?

Essentially anyone with some knowledge of R and some form of quantitative data that they need to visualise. This can be anyone from an in-house data scientist who is responsible for generating reports for colleagues and supervisors, scientists preparing their next publication or presentation or journalists who want to add another tool to their data journalism kit – anyone who appreciates that visualisation is an essential component of the data analysis and communication process.

Typically, participants in previous workshops have been graduate students in the life sciences, but we invite people from outside science participate.

I have no experience using R. Is this workshop still appropriate for me?

In the past, participants have commented that we place too much emphasis on prior knowledge of R, to the point that some participants decided not to attend. We err on the side of caution.

Data handling is not covered in this workshop. In short, if you have no experience using R, you may find this workshop very challenging. This is particularly true if you have a poor understanding of your data and how to handle it. For this situation we refer you to the Data Analysis Workshop, which provides an introduction into programming in R.

In our experience, if you have already worked with other programming/scripting languages (e.g. Python or MatLab), you will have a much easier time, even if you only know the basics of R.

Another consideration is the format of your data – how are your variables and observations arranged. Rearranging your data manually is not only time-consuming but also error-prone. To help with this, we will briefly cover reshaping data. If you have a difficult time with R, this will again be a challenging part of the workshop for you. The less comfortable you are with R, the cleaner your data should be.

I don’t have any data. Will you provide data sets for me to work on?

In short: no. Although there are plenty of built-in data-sets in R, which we make use of in the workshop, we emphasise that working on your own data is the most beneficial use of your time.

Who is the instructor?

The workshop will be lead by Dr. Rick Scavetta, a biologist and co-founder of Science Craft. Rick has over three years of experience developing data visualisation solutions and teaching on-site Data Visualisation workshops for scientists of various disciplines. He is frequently hosted by graduate schools associated with Max Planck Institutes and Clusters of Excellence across Germany.

Participants consistently remark on Rick’s professional yet approachable presence in workshops. The atmosphere is relaxed, fun and participatory – everyone is encouraged to contribute their opinions and experiences – which fosters a positive learning environment.

Rick has authored the reference book used in the workshop and also offers his services as a “visual editor” for scientific publications. Last year, Rick spoke at the re:publica conference in Berlin and was invited to present concepts in data visualisation for Quarks & Co., a popular science program on WDR in Germany.

Will meals and accommodation be provided?

Coffee, snacks and lunch for each day are included in the workshop price. No accommodation arrangements are provided.

Will computers be provided?

No, participants are expected to bring their own computers and data.

How many participants will there be? How many hours of instruction are there per day? What reference material do I get?

The workshop is limited to 12 participants. Each 8-hour day consists of 6.5 hours of instruction and exercise time plus coffee breaks (2 x 15min) and a lunch break (1 hour). In addition, each student receives a reference book, written by the instructor, containing additional material.

What are the costs?

Early bird registration (until 12.07.2015) is 500EUR. Regular registration (13.07 – 23.082015) is 550EUR, and late registration (24.08.2015 – 01.09.2015) is 600EUR.

How can I register?

You can apply for participation via Betahaus at this link.

Technical

I would like to make a very specific type of plot. Is this possible?

We will work with you to develop an appropriate visualisation solution. So far, most participants have been able to use the introduced packages to handle their data and produce appropriate and meaningful plots. If there are specific plot types you are keen on making, e.g. triangle plots, chord diagrams, Sankey diagrams, networks, etc., it would be helpful if you inform us in advance of the workshop. Special packages or data classes may be required.

What about HTML widgets? I heard that we can now use JavaScript data visualisations in R.

HTML widgets is an exciting and growing, but still relatively new, development in R. This will not be a focus of the workshop, but depending on the interest and R proficiency of the participants, we may explore some of these plot types.

What R packages are you going to focus on?

For the second day, we’ll rely mostly on ggplot2 and associated packages such as RColorBrewer, gridExtra, GGally and ggthemes. For the third day, we’ll move onto Rmarkdown and shiny.

Why don’t we just use D3 to make interactive plots?

D3 is wonderful – but this workshop focuses on using R. R has gained immensely in popularity in recent years – both within and outside academia – because it is remarkably flexible and sits at the cross-roads of powerful statistics, data analysis and data visualisation. For scientists already accustomed to using R, there is no need to move to another language to produce both publication-quality figures or interactive graphics.

We acknowledge the limitations of data visualisation in R – there is no one tool for all jobs. If there is a D3 plot type that you are keen on producing, this may not be the workshop for you.

Assistance

Is there any financial support available?

Unfortunately, we are not able to offer subsidised or reduced registration fees.

I need to produce a complex plot, but I’m not ready/interested/able to attend this workshop – what should I do?

If you require immediate results, we offer a Visual Editing service. You can submit enquires directly to rick.scavetta@science-craft.com.

Our goal is to offer this as a recurring in-house workshop in Berlin. You may want to attend a future workshop.

If you are associated with a research institute or graduate school, you may consider organising a Data Analysis workshop, which will give you the necessary background. You can follow this up with a Data Visualisation workshop at a later date.