Tidyng human-readable data by inserting suitable separators.
R post archive
Cleaning data from the US Financial Disclosure Reports featured in the A.B. Tran and H. Wickham live data challenge.
Animated explanation of dplyr::top_n() and code to create the gif.
Using gganimate to animate the data-munging process.
A package to handle awkwardky-structured data with multiple header rows or values broken across multiple rows.
Wrangling grouped data with broken values and NA/empty rows per variable per group.
Extracting and wrangling data encoded as text formatting in multi-sheet Excel files.
Plotting how popularity rankings for dog breeds change through time.
Using tidy evaluation to fix values broken up across rows.
Omitting items from legends in ggplot2.
Manipulate and visualize data describing dog breeds and how they are related.
Split a table by a grouping variable, apply functions to each element, and export to separate files.
Downloading, comparing, and visualizing Twitter ratings for dogs and cats.
Wrangle duplicated variable names, weird header rows, and footnotes.
Plotting density distributions with the underlying data as a point swarm.
An R function to put interspersed header rows into their own column.
Wrangling commonly-used data formats. Part 2 - Compound values.
Wrangling commonly-used data formats. Part 1 - GenBank accession tables.
Recoding binary dummy variables into factors. Includes tidyverse and base R approaches.
Working with data describing dog breeds and how they are related.
Extracting dog names from tweets by WeRateDogs.
Text analysis applied to mammalian common names.
Using web scraping, raster objects and gridExtra to add images to your plots.
My take on the SkepticalScience global warming figure using ggplot, includes R code and data.
Extracting edge lengths from R phylo objects.
Matching taxonomic names; joint post with Manuela González-Suárez.
Using R and dplyr to extract minimum or maximum (or both) variable values within groups.
Making diverging stacked bar plots in R using ggplot.
Three ways to draw polygons around groups of points using ggplot.
Measuring spatial overlap with point data.
Emoji flags, international research, and tweet analysis in R.
Manipulating variables and content in biodiversity datasets using base R and dplyr.
Visualizing grouped values using labeled side-by-side bars in ggplot2.
Summary and visualization of research effort for recently described species of terrestrial mammals. Includes R code and data.
Mapping spatial patterns in conservation research using R and ICCB 2011 presentation data.
Using R to fetch elevation for lat/long data through an API.
How long is the average section in a research article? Includes code for downloading and plotting data for five journals.
Example code for visualizing binary trees with dichotomous responses in R, focused on extinction risk modeling.
Summary of research presented at scientific meetings of the Mexican Mammalogy Society (AMMAC) 1991-2012.
Modeling IUCN Red List data as an ordered response variable. A reproducible #rstats example for terrestrial carnivores.