Data Dojo Würzburg 8
January 2022
- When: Thursday, January 13th, 2022 at 6:00pm
- Where: Zoom
- Zoom: The event has ended
- Info: DataDojo Website, Repo
Participants
Please add your name to the list (click the pen icon at the top left to edit) if you plan to come. And please remove it if you can not make it. Feel free to add your preferred tool or programming language.
- Markus (julia)
- Robin R. (R/julia, maybe python)
- »add your name here«
Dataset
Results of the German Federal Election including local results from Würzburg (Stadt and Landkreis)
Question Pool:
- Generic
- What kind of information is stored in the table(s)?
- How much data is missing?
- Is the dataset clean or are there any clear outliers?
- Specific
- In how many Wahlkreisen did the Greens lose votes compared to 2017?
- In how many Wahlkreisen did the AfD lose votes compared to 2017?
- What is the fraction of Wahlkreise for each party in which they did not reach 5%?
- Which party had the strongest decline/increase in any federal state?
- Which party had the most extreme results? → highest variance in votes by Wahlkreis
- Are the Pirates still relevant in any Wahlkreis/State?
- Which party below 5% had the best result in any Ort/Stadtteil around Würzburg?
- Which was the strongest below-5%-party per Ort/Stadtteil?
- Which Orte/Stadtteile voted most similar/dissimilar?
- Visualize the results of the local votes in a suitable way :stuck_out_tongue_winking_eye:
- Add your own questions
- Further Ideas
- Can we link the results to other demographic information (e.g. mean age, gender distribution, …, data for Würzburg Stadt, more)
- Show results with district resolution on an interactive map (e.g using these shapes)
- Add your own ideas
- Inspiration
Collaborative Tools and Workflow
For Notebooks (R, python, julia, js, …) with real time collaboration CoCalc seems to be the best option right now. It worked great the last couple of times so we’ll stick to it for now. You need to register an account there (it is free).
Future Suggestions
Add your suggestions to the list and :+1: to the end of a line you are interested in
Data Sets
- Results of the Bundestagswahl 2021
- Weather data throughout Germany over time (incl. temperature, precipitation, …): https://www.dwd.de/DE/leistungen/cdc_portal/cdc_portal.html
- German Mikrozensus
- Kaggle Titanic or Tabular Playground or Meta Kaggle
- World Trade Data (Open Trade Statistics)
- Open Citation Data
- Top 100 charts + Audio Features
- Emoji Usage :hugging_face::heart::laughing:
Tools/Languages
Skills
- interactive maps
- dashboards
- animations
Data Sources
all data types are welcome, including tables, images, videos, sounds, DNA, …
- TidyTuesday
- Our World in Data (R package: owidR), Sustainable Development Goals
- Open Data Initiatives (Würzburg, Germany, Statistisches Bundesamt, Europe, APIs)
- Awesome Public Datasets
- Kaggle Datasets or Competitions, e.g. SLICED
- tsibbledata: Time Series Datasets
- R-text-data: Text Datasets, ready to use in R
- data.world
- Statista - the University of Würzburg has a campus license
- Open Legal Data
- Bundestag Data (e.g. poll results, deputies, wahl-o-mat, inspirational blog post)
- Deutsche Digitale Bibliothek (API, old newspapers from Germany)
- Earth Observation: Satellite Image Time Series
- Machine Learning Datasets
- Internation (Student) Assessment Data (TIMSS, PIRLS, PISA, …)
- (Medical) Imaging Datasets, MedMNIST