As data scientists, getting our hands on the data we need is often the most challenging part of a project. In practice, we tend to make life hard on ourselves because we don’t use the best tools for the job. Well no longer! Read on to learn how can you can harness Airflow to orchestrate your own ETL processes like a pro!
Have a look at what my team and I worked on during the Permafrost Hackathon in Zurich. The goal was to detect movements from multitemporal images. Since the images didn’t have any labels, we used unsupervised learning methods. Check it out, yo!
For all those, who are struggling with the (kind of weird) Johns Hopkins University COVID-19 case data CSV files, we’ve created a free API that makes it easy to integrate the latest worldwide COVID-19 data into your application.
This blog is a hands-on experience in Dash, presenting core components, how to display figures with callbacks, supplying you with a working web application to play with, and the resources to build your own. Dash is a powerful tool for Python developers. Developed by the team behind Plotly, Dash is an open-source framework built on top of Flask, Plotly.js, and React.js.
As a REST API, Livy provides Spark interaction without any need of a Spark configuration on your client. Once being able to communicate with the API Spark code can be submitted from everywhere.
Besides working hard to provide our clients with cutting-edge Machine Learning solutions, we are also big fans of all things culinary here at STATWORX. But can we apply some of those algorithms to make us better cooks? This blog article explores the unlikely union of Data Science and baking!
Want to obtain a specific dataset from a website which does not have an API? In this post, I explain how to do this by scraping data using Python, how you determine whether it is allowed to scrape a specific page and more.
Could you #BeatTheAI? We let deep learning have a go at Super Mario’s first level and compared it to human players. Here we explain how we did it!
Data Science Einsteiger stehen immer wieder vor der gleichen Frage: Welche Programmiersprache sollte man als Erstes lernen? Die Wahl fällt meistens auf eine der beiden großen Anbieter, R oder Python. Mit diesem Blogartikel wollen wir bei der Suche nach der geeigneten Programmiersprache helfen.
Our latest tool development at STATWORX: random boost, an algorithm twice as fast as gradient boosting, with comparable prediction performance.