System of right-angled coordinates

Coordinate systems in ggplot2: easily overlooked and rather underrated

Lea Waniek Blog, Data Science, Statistik

All plots have coordinate systems. Perhaps because they are such an integral element of plots, they are easily overlooked. However, in ggplot2, there are several very useful options to customize the coordinate systems of plots, which we will not overlook but explore in this blog post. Since it is spring, we will use a random subset of the famous iris …

About Risks and Side-Effects… Consult your Purrr-Macist

David Schlepps Blog, Data Science, Statistik

Capture errors, warnings and messages, but keep your list operations going In a recent post about text mining, I discussed some solutions to webscraping the contents of our STATWORX blog using the purrr-package. However, while preparing the next the episode of my series on text mining, I remembered a little gimmick that I found quite helpful along the way. Thus, …

Testing von R Paketen

Markus Berroth Blog

Der letzte Blogbeitrag zeigte warum Testing sinnvoll ist und wie sich der Workflow für einzelne Skripte mit dem testthat-Package gestalltet. Dieser Beitrag soll sich um die Integration von Unit-Testing für eigene Pakete drehen. Set-Up Der einfachste Weg, um Unit-Testing mit dem testthat-Package einzurichten ist devtools::use_testthat() laufen zu lassen. Dies bewirkt mehrere Dinge: Es erzeugt das Verzeichnis tests/testthat. Es fügt testthat …

pandas vs. data.table – A study of data-frames – Part 2

Tobias Krabel Blog, Data Science

The story continues As Christian and I have already mentioned in part 1 of this simulation study series, pandas and data.table have become the most widely used packages for data manipulation in Python and R, respectively (in R, of course, one may not miss mentioning the dplyr package). Furthermore, at STATWORX we have experts in both domains, and besides having …