R 4.0

R 4.0 is out Friday, April 24 (American time), so Saturday for me.
It seems I will have to install all my packages for R again. To find out what packages you have, run this R code:

ip <-[,c(1,3:4)])
rownames(ip) <- NULL
ip <- ip[$Priority),1:2,drop=FALSE]
print(ip, row.names=FALSE)

To uninstall R:
1. Delete the R app in the applications folder and empty the trash.
2. sudo rm -rf /Library/Frameworks/R.framework

Now, install R 4.0

Change Log

Convert PDF pages to JPEGs using R

I often get PDFs which have interesting images in them, but the problem is how to extract them?
This R Code will find every PDF in the current folder and covert each page to a 200 dpi JPEG


# Uses the fs library to list all files ending in PDF in the current directory and store them in file_list

file_list <- dir_ls(glob = "*.pdf")

# The eqivualent of a FOR loop, it iterates through each element of file_list and converts each PDF page to a 200dpi jpg using the pdftools library
lapply(file_list, FUN = function(files) {
  pdf_convert(files, format = "jpeg",dpi = 200)



Web Links for March 2020



Visual Studio Code

Raspberry Pi

Compile/install Python 3.8 on Raspberry Pi | Michael Hirsch, Ph.D.
Raspberry Pi 4 Bootloader Firmware Updating / Recovery Guide – Home Assistant
Turns your Raspberry Pi (or another device) into the ultimate home automation hub powered by Home Assistant
Dual Fan Aluminium Heatsink Case for Raspberry Pi 4 Black Australia
Raspberry Pi 4 USB Boot Config Guide for SSD / Flash Drives
GitHub – log2ram: ramlog like for systemd (Put log into a ram folder)
Log2Ram: Extending SD Card Lifetime for Raspberry Pi


JaYoe World Tour Homepage | Follow Matt Cycling Around The World!

Purchased “The Art of Statistics” from Amazon for $30

Statistics has played a leading role in our scientific understanding of the world for centuries, yet we are all familiar with the way statistical claims can be sensationalised, particularly in the media. In the age of big data, as data science becomes established as a discipline, a basic grasp of statistical literacy is more important than ever.

In The Art of Statistics, David Spiegelhalter guides the reader through the essential principles we need in order to derive knowledge from data. Drawing on real world problems to introduce conceptual issues, he shows us how statistics can help us determine the luckiest passenger on the Titanic, whether serial killer Harold Shipman could have been caught earlier, and if screening for ovarian cancer is beneficial.

How many trees are there on the planet? Do busier hospitals have higher survival rates? Why do old men have big ears? Spiegelhalter reveals the answers to these and many other questions – questions that can only be addressed using statistical science.
The Art of Statistics: Code, Data, Errata and Additions | ArtofStatistics


I built something similar — a bit more on the #NLP

from Twitter


Accelerate your plots with ggforce

from Twitter

Publication quality figures with ggplot2

from Twitter

rstats monsters illustrations

from Twitter


Started studying a book I purchased in February Data Visualization: A Practical Introduction

The book provides students and researchers a hands-on introduction to the principles and practice of data visualization. It explains what makes some graphs succeed while others fail, how to make high-quality figures from data using powerful and reproducible methods, and how to think about data visualization in an honest and effective way.

 Data Visualization builds the reader’s expertise in ggplot2, a versatile visualization library for the R programming language. Through a series of worked examples, this accessible primer then demonstrates how to create plots piece by piece, beginning with summaries of single variables and moving on to more complex graphics. Topics include plotting continuous and categorical variables; layering information on graphics; producing effective “small multiple plots; grouping, summarizing, and transforming data for plotting; creating maps; working with the output of statistical models; and refining plots to make them more comprehensible.

Web Excursions for March 2019

R for Blogging

The fastest cyclists of Europe live in …
Analyzing STRAVA data to find out which city has the faster cyclists with R and R-shiny

The Best Free Books for Learning Data Science

Lecturer who uses Academic hugo theme for her website on github

Article written in Blogdown:

Article written in Radix:

Radix is based on the Distill web framework, which was originally created for use in the Distill Machine Learning Journal. Radix combines the technical authoring features of Distill with R Markdown.


Started learning R

I am studying data science. I have chosen to learn R first (before Python) because of its excellent visualisation capabilities.
I have just finished the first unit
– Introduction to the Tidyverse Course