University Libraries

Libraries to offer workshop series on programming language R and RStudio

Series of four workshops will focus on research reproducibility and data management

Credit: Chris Blaska / Penn State. Creative Commons

UNIVERSITY PARK, PA. — Beginning Oct. 15, the Research Informatics and Publishing department at Penn State University Libraries will offer a series of four workshops on research reproducibility and data management in the programming language R and its associated open-source integrated development environment, RStudio. These workshops will introduce participants to implementable practices that support reproducible and open research. More specifically, they will learn how to use R and RStudio to apply data management practices and enable reproducible analysis and documentation workflows.

R allows users to wrangle data sets, conduct statistical analyses, create data visualizations, and develop reproducible and documented analysis workflows. This workshop series will offer hands-on training in fundamental coding skills, data management strategies in R to support research reproducibility, and data visualization. Participants can expect to learn how to wrangle data into an analysis-ready format, use R packages and connections to manage R projects, and create data visualizations using the R package, ggplot2.

The workshops are free and open to Penn State graduate students, postdoctoral scholars, faculty and staff. Beginner knowledge of R is recommended, but no previous knowledge of R is required.

Because the workshops build upon one another, participants are expected to complete each one in sequence. The first workshop, “Introduction to R and RStudio,” is optional but is recommended for those who have never used R or RStudio before.

Participants must have access to a computer with a Mac, Linux or Windows operating system and be able to download R, RStudio and Git applications. Registrants will receive instructions on how to access these applications prior to the start of the workshops.

All workshops will be held virtually via Zoom. Llinks will be distributed via email following registration. Advance registration is required; registration and additional information is provided at the link below.

For additional information, contact Research Informatics and Publishing at repub@psu.edu.

Register for this workshop series here.

Workshop Schedule 

Introduction to R and RStudio — Oct. 15, 1–3 p.m.

This session will introduce R and RStudio, walk through the platform interface, and discuss the utility of using the software for reproducible research practices. More specifically, participants will learn how to set a working directory, load data and packages, and discover how to find resources to support general learning and to answer specific questions.

Data Wrangling in R — Oct. 22, 1–3 p.m.

This session will introduce the use of the package data.table to manage, clean, and transform data into “tidy” format or create new variables in a reproducible manner. Additionally, participants will learn how to handle string and date/time data.

Data Management and Research Reproducibility in R and RStudio — Oct. 29, 1–3 p.m.

This workshop will focus on data management strategies that participants can implement in R and R Studio to develop a reproducible analysis and output workflow to facilitate transparent and reproducible research, as well as support open data sharing.

Data Visualization in R — Nov. 5, 1–3 p.m.

This workshop will provide an overview of how to use the R package, ggplot2, to create meaningful data visualizations.

Last Updated September 24, 2025