Accessing your Fitbit data

Since I am a data junkie and bought my Fitbit Charge HR mainly because I wanted to collect and analyse data about myself, I was looking for ways to download the data to my computer. For most people the great stats overview in the app and in the online dashboard will be sufficient but some … Continue reading Accessing your Fitbit data

Learning Club 05-07: Starting to love rmarkdown (Naive Bayes, Clustering, Linear Regression)

I remember when I had an R course at university I was really not a fan of rmarkdown and knitr. But since I participate in a Learning Club, where people are encouraged to document and present their code, data and results, I started to love it. Prior to that I’ve always documented my assignments at the university either … Continue reading Learning Club 05-07: Starting to love rmarkdown (Naive Bayes, Clustering, Linear Regression)

Netflix Socks Part 1: Set up the Arduino

Some weeks ago Netflix posted a cool project on their website, Netflix socks that will pause your show when you fall asleep. Since I fall asleep a lot during watching TV, I thought this was a cool idea for a small project. In this post I’ll show you how to set up the Arduino for … Continue reading Netflix Socks Part 1: Set up the Arduino

Karten in Blog Posts einfügen

Da ich Karten und GPS Daten sehr interessant finde, bastle ich gerne Karten und füge sie z.B. hier auf meinen Blog ein. Prinzipiell fallen mir 3 Möglichkeiten ein, Karten in Blog Posts einzubinden. Zu jeder dieser 3 Arten stelle ich euch ein konkretes Beispiel vor und erkläre euch die Vor- und Nachteile. Externe Karten einbinden: … Continue reading Karten in Blog Posts einfügen

Data Analysis with Microsoft Excel: Tables

Auf ambassadorbase.at ist mein Artikel auf Deutsch verfügbar. In my job and my studies I recently finished I work with lots of different data sources and you will also meet all of them throughout your career as a data scientist. Data can be given to you as an SQL dump, XML files and many other … Continue reading Data Analysis with Microsoft Excel: Tables

Finding data sets Part 2: TV, music, book ratings and sports data

The first part gave a more general overview on where to get data. This section will give you specific data sources, e.g. if you like sports, movies, books, … and so on. Over the next couple of weeks you’ll find these posts on my blog: General data sources TV, music, book ratings and sports data … Continue reading Finding data sets Part 2: TV, music, book ratings and sports data

Finding data sets Part 1: General data sources

I often encounter interesting algorithms or R packages which I want to test. The nice ones provide data for testing but often it is only dummy data. To get a good understanding of the method and its limitations real data might be required. Sometimes I would also like to explore data I have not used … Continue reading Finding data sets Part 1: General data sources

Mit “Shariff” datenschutzkonforme Social Media Buttons einbinden

Dass Facebook und Co. Userverhalten mit den Like- & Share-Buttons tracken, hat mich immer schon gestört, aber halt nicht genug um auf ein Plugin zu verzichten, das mir automatisch alle möglichen Social Media Buttons einbindet. Kürzlich ist es zu einem Sieg für Verbraucherschützer gekommen (zeit.de berichtete): Wer einen “lauschenden” Facebook-Button einbindet, muss seine Nutzer darüber … Continue reading Mit “Shariff” datenschutzkonforme Social Media Buttons einbinden

Deriving the Predicted Residual Sum of Squares Statistic

Recently I was looking into measures to evaluate a regularized least squares model. One thing I would have liked was cross-validation to be able to compare different models. When researching possibilities, I discovered PRESS (Predicted Residual Sum of Squares Statistic). The main resources I used to study the subject are: [1] Adrien Bartoli: Maximizing the … Continue reading Deriving the Predicted Residual Sum of Squares Statistic

Easy and efficient way to log overwriting of a directory in SQL Server Integration Services (SSIS)

Recently I had the problem that I had a File System Task that moved a file but whenever the file was already there the package failed. So I set OverwriteDestination to TRUE. But now I lost complete control over which files were just moved and which did overwrite some already existing directory. My desired result … Continue reading Easy and efficient way to log overwriting of a directory in SQL Server Integration Services (SSIS)