Learning Club 16: Genetic Algorithms

Some time ago I published a blog post with the title Know your data structures!. In this previous post I explained how I improved the running time of a genetic algorithm. I promised to go more into detail about other noteworthy things in the code in a separate article since not everything was straightforward when … Continue reading Learning Club 16: Genetic Algorithms

Know your data structures!

Just a few days ago I stated the following on Twitter: Just reduced the runtime of an algorithm from 9 hours to 3 min. by using a different data structure… Know you data structures 🙂 #rstats — Verena Haunschmid (@ExpectAPatronum) May 1, 2017 Since my tweet has been liked and shared a lot, I thought … Continue reading Know your data structures!

Presentation “R for Data Science”

Some weeks ago I had a presentation at my work place about “R for data science” that I’d like to share with you. I’ve written the slides in R and rmarkdown and uploaded them to rpubs.com. I chose to use rmarkdown for my slides although we have great company PowerPoint templates, because I wanted to … Continue reading Presentation “R for Data Science”

Learning Club 05-07: Starting to love rmarkdown (Naive Bayes, Clustering, Linear Regression)

I remember when I had an R course at university I was really not a fan of rmarkdown and knitr. But since I participate in a Learning Club, where people are encouraged to document and present their code, data and results, I started to love it. Prior to that I’ve always documented my assignments at the university either … Continue reading Learning Club 05-07: Starting to love rmarkdown (Naive Bayes, Clustering, Linear Regression)

I was guest at the Becoming a Data Scientist Podcast!

Almost 2 weeks ago the Becoming a Data Scientist podcast had 4 special interviews – each of them with members of the Learning Club, including me! I was super excited when Renee asked me some weeks ago if I wanted to participate and I was a little bit nervous during the interview. But I think … Continue reading I was guest at the Becoming a Data Scientist Podcast!

Use rvest to scrape NFL weather data

If you are following my progress in the Data Science Learning Club you might know that I am using NFL data for the tasks. For predicting sports events I think it is not only important to have statistics about the players, teams and previous games but also about the weather. From when I was a … Continue reading Use rvest to scrape NFL weather data

Learning Club 01: Find and explore a dataset

The first activity of the data science learning club I am participating in is to find and explore a dataset. I already described the data I found and will use in the last post. You can follow all my learning club related activities here. The tasks of this activity are (quoted from the thread above): … Continue reading Learning Club 01: Find and explore a dataset

Learning Club 00.b: Setup your development environment (Get started with python package nfldb)

So, just a few days ago I posted Learning Club 00: Set up your development environment (Getting started with R). There I made a mistake and decided to use R without thinking about the data set I would use. I am still happy I wrote the post because it can give all the R users … Continue reading Learning Club 00.b: Setup your development environment (Get started with python package nfldb)

Learning Club 00: Set up your development environment (Getting started with R)

A few weeks ago I became aware of Renee’s (owner of the blog Becoming a data scientist) plan to start a data science learning club and I thought it was a cool idea. In the learning club she will post activities and the first one was about setting up your development environment: Activity 00: Set … Continue reading Learning Club 00: Set up your development environment (Getting started with R)