Before you can analyze data, it must be in the right form. Join me on April 25th for a 4-hour webinar that shows how to perform the most commonly used data management tasks in R. We will work through hands-on examples of R’s popular add-on packages such as plyr, reshape, stringr, lubridate and sqldf. I’ll also be presenting a 3-hour version at the UseR! 2014 conference. Here’s a list of the topics covered:
- Transformation basics
- Conditional transformations
- Summarization of columns and rows
- Summarization by group
- Analysis by group
- Sorting data
- Selecting first or last observation per group
- Miscellaneous variable tools (rename, keep, drop)
- Stacking data frames
- Finding and removing duplicate observations
- Merging data frames
- Reshaping data frames
- Character string manipulations
- Date / time manipulations (not in shorter useR! presentation)
- Using SQL within R (not in shorter useR! presentation)
Many examples come from my books, R for SAS and SPSS Users and R for Stata Users. That makes it easy to review what we did later with full explanations, or to learn more about a particular subject by extending an example which you have already seen.
At the end of the workshop, you will receive a set of practice exercises for you to do on your own time, as well as solutions to the problems. I will be available via email at any time in the future to address these problems or any other topics in my workshops or books. I hope to see you there!