Apple Health data exploration with Atlas, Clickhouse, Vega-Altair and Quarto : r/QuantifiedSelf Skip to main content

Get the Reddit app

Scan this QR code to download the app now
Or check it out in the app stores

Apple Health data exploration with Atlas, Clickhouse, Vega-Altair and Quarto

Hey everyone!

A few days ago I wrote a simple python script ("Atlas") that turns the Apple Health export.xml file (which is about 1 GB in my case, with about 10 years of data) into a very simple parquet file (a bit like a compressed CSV) that is also way smaller (40 MB).

The parquet file has 5 columns:

  • type (e.g. "CyclingDistance")

  • value (e.g. "12.100")

and 3 datetime timestamps:

  • start

  • end

  • created

This makes it way easier to do data exploration. Here are a few example charts I generated using Clickhouse (chDB) and Vega-Altair in a Quarto notebook.

r/QuantifiedSelf - Apple Health data exploration with Atlas, Clickhouse, Vega-Altair and Quarto
r/QuantifiedSelf - Apple Health data exploration with Atlas, Clickhouse, Vega-Altair and Quarto
r/QuantifiedSelf - Apple Health data exploration with Atlas, Clickhouse, Vega-Altair and Quarto
r/QuantifiedSelf - Apple Health data exploration with Atlas, Clickhouse, Vega-Altair and Quarto
r/QuantifiedSelf - Caffeine in mg
Caffeine in mg
r/QuantifiedSelf - Caffeine consumed after 17:00
Caffeine consumed after 17:00

More than happy to look into adding examples for charts that you are interested in. Atlas is on Github (⭐️ star it to stay tuned for updates!):

https://github.com/atlaslib/atlas

There I've also added screenshots for how to get the Apple Health export.xml file and also example code for how to generate charts from the parquet file.

Share
Thinking Snoo

Be the first to comment

Nobody's responded to this post yet.
Add your thoughts and get the conversation going.