Resources for Business Analytics in R

Jose M Sallan 2024-01-29 3 min read

In this post I present some resources for business analytics in R. Most of them are books that can be consulted freely online, and also buy the printed edition.

Tabular data manipulation

This work explains the functionalities of tidyverse, it works quite well as a reference book. I also attach the Spanish translation for Spanish-speaking users.

The data.table library covers functionality similar to dplyr and tidyr. It has a more complex syntax, but is more versatile and can handle larger tables.

Graphics

Two resources dedicated exclusively to exploiting the functionalities of ggplot. They can be useful for creating graphs to disseminate results within the organization or for publications.

Book similar to the previous two, which also presents the graphical utilities of base R.

Book on how to design graphics and presentations, based on the premise that in any presentation we are telling a story.

Text

R also includes functionalities for analyzing text. This is an introduction to exploratory text analysis within the tidyverse framework.

Maps

A reference for those interested in examining geographic information, for example by examining business locations in a city. This text is a comprehensive introduction to spatial analysis with R.

Supervised learning and machine learning

Until 2019, caret was the standard for R supervised learning workflows, and many users rely on this framework today. The first two books present how to use caret. Later, the author of caret joined the team that develops tidyverse at Posit to develop tidymodels. This set of packages is presented in the third book of this listing.

Publication

In addition to providing analysis tools, R and RStudio provide ways to present results in a reproducible way.

A system to generate reproducible documents, presentations and dashboards.

An open-access scientific and technical publishing system to create all types of documents: Jupyter notebooks, web pages, books or blogs.

An interactive website production system for data analysis.

Another resources

Since 2018, the Tidy Tuesday community has presented a dataset to work with throughout the week. The idea is to have data to practice and share the results with the community.

Julia Silge’s blog presents practical R tutorials for exploratory data analysis and machine learning.

When possible, I publish posts here mainly about R and data analysis. This is a work in progress, which among other things lacks a post index and a proper name. I will be grateful if you can ask me questions and problems in R to create new posts.