Data Dojo Würzburg 2
July 2021
- When: July 8th, 2021 at 6:00pm
- Where: Zoom
- Zoom:
- Link
- Meeting ID: 985 2364 3600
- Password: 175005
- Info: DataDojo Website, Repo
Collaborative Tools and Workflow
We have not found the ideal solution, yet. Some options are available as preview features but not stable, yet. So this time we will likely start with plain Zoom screen sharing, we’ll switch to something different based on participant interest: probably CoCalc.
Real time collaboration tools
Feel free to add suggestions to this list
- VS Code with Live Share Extension (very promising but notebook support currently broken), languages: python, R, julia, …
Unfortunately, notebook support with VS Code Live Share is an experimental feature which is currently broken (but expected to be fixed soon): https://docs.microsoft.com/en-us/visualstudio/liveshare/reference/notebooks and https://github.com/MicrosoftDocs/live-share/issues/4291. Update: on June 22, this issue was anounced as fixed. Unfortunately there is a new (unrelated) issue that prevents notebook output from being shown for guests: https://github.com/MicrosoftDocs/live-share/issues/4342. This issue is being worked on...
Once this is available, it seams to be the ideal solution especially as it has a unified interface independent of the language (python/julia/R/...)
> If notebook support is fixed the [VS Code Insiders](https://code.visualstudio.com/insiders/) version would be required as long as it is experimental. This can be installed alongside regular VS Code without problems.
- Jupyter Lab real time collaboration (alpha feature), languages: python, R, julia, …
- Observable multiplayer (experimental feature), languages: javascript
Dataset
– suggested by Timo, thanks!
Potential Questions:
- Comparison of regions/countries/continents
- Comparison of plant-types/energy sources
- Find additional data to map MW demand vs. production (per region/country/continent)
Question Pool:
- Generic
- What kind of information is stored in the table?
- How much data is missing?
- Is the dataset clean or are there any clear outliers?
- Specific
- Which country has the highest number of powerplants (how many, top 10)?
- Which is the country with the highest power preduction (MW) per continent (how much, top 5)
- What is the overall proportion of each energy source (by plant number, by MW produced)
- Which is the greenest country (highest fraction of power from renewable sources)?
- How does the distribution of power plants look on a worldmap (by type, interactive)?
Participants
Please add your name to the list if you plan to come. And please remove it if you can not make it.
- Markus
- Timo
- Stefan -
Future Suggestions
Add your suggestions to the list and :+1: to the end of a line you are interested in
Datasets
- https://raw.githubusercontent.com/wri/global-power-plant-database/master/output_database/global_power_plant_database.csv
Kinds of Questions
- Comparison of regions/countries/continents
- Comparison of plant-types/energy sources
- Find additional data to map MW demand vs. production (per region/country/continent)
Tools/Languages
- R/tidyverse
- python
- Excel
- veezoo
- Power BI
- javascript
Skills
- interactive maps
- dashboards
- animations