Beim Erarbeiten neuer Skills und Tools ist der Einstieg nicht immer einfach. In der STATWORX Academy bieten wir daher eine Fülle an Schulungsformaten an, in denen wir Dir den Einsteig in den Bereich Data Science erleichtern wollen. Unser Kollege Jan erzählt in diesem Blogartikel vom „R for Data Science“ Workshop, den er in unserem Frankfurter Office geleitet hat. Erfahre in diesem Artikel, an wen sich der Kurs richtet und welche Inhalte geboten werden.
Benchmarking your code is one thing – another thing is to keep and use the gained knowledge for future projects. In this blog, Jakob presents his collection of benchmarks and creates an easy to use a setup for new ones.
Training random forests on time series is one thing, but tuning them? It’s not like you can just apply cross validation and be done with it. Or can you? This post forms part two our mini-series “Time Series Forecasting with Random Forest”. Find out how you can tune the hyperparameters of the random forest algorithm when dealing with time series data. The answers might surprise you!
rBokeh is an interactive plotting library. Since it functions lack some arguments compared to its Python counterpart, plots are sometimes difficult to customize. I will show how to overcome those issues and drill out the plot objects.
Cross-validation is a widely used technique to assess the generalization performance of a machine learning model. In this blog post I will introduce the basics of cross-validation, provide guidelines to tweak its parameters, and illustrate how to build it from scratch in an efficient way.
Time flies by and we are more people now here at STATWROX. But, did we change our behavior in using emojis or are we still the same? I am revisiting my analysis and will have a look!
Shiny enables its users to quickly create a fixed UI with code. Although simple, this can prove to be quite limiting. Applying the principles of reactivity to the UI part of a ShinyApp is a natural progression from reactive coding as we know it already from Shiny’s server side. This blog entry reviews and discusses two of the most convenient tools for doing so.
Data Science beginners often encounter the same question: which programming language should one learn first? The choice usually falls on one of the two major providers, R or Python. With this blog article, we want to help you with the search for the right programming language for you.
What are driving factors behind the gas price? With freely accessible data we are goging to find out if the brand, the location and more have any impact on the price!