General Data Story Instructions
Overview
In Part II of the course (weeks 5–10), you will apply the tools and
skills you learned in Part I to create your own data stories. You'll
cycle through three stages repeatedly:
- Pitching and brainstorming. Present your ideas to classmates,
- Refinement. Workshop your ideas and code. Bring up new content,
- Critiquing. As a class, we'll discuss your story and provide
form groups, and workshop story ideas.
tools, or techniques you'd like to learn more about.
constructive feedback. We'll also go through your code and discuss
relevant coding issues (challenges, clever hacks, etc.).
You should plan to make it through this cycle at least three times
during Part II — that is, you should produce at least 3 data stories.
What is a Data Story?
A data story is a short (5-minute) presentation that uses data to tell
a compelling narrative. It combines:
- A question you want to answer or explore
- Data that helps you investigate that question
- Analysis and visualizations that reveal patterns and insights
- A narrative that ties it all together for your audience
You can work individually or in groups of any size. Projects and
groups should form organically and remain flexible.
Deliverables
Each data story should be contained in a single sub-folder of
Your project should include the following files, based on the
1. README.md
Based on the
your README should contain:
- Project information: project name, authors, GitHub usernames
- Overview: main question, approach, data used, tools used, key findings
- YouTube link: a link to your 5-minute data story video
- Data links: where to find the data you analyzed
- Running the code: instructions for replicating your results
- Contributing: next steps, open questions, known issues
- Acknowledgements and citations
2. YouTube Video (5 minutes)
A video presentation of your data story. You can:
- Narrate over a screencast of your notebook
- Create slides (using Marp, PowerPoint, Keynote, etc.)
- Use any other creative format you like
The key is to tell a compelling story backed by data, not just
walk through code.
3. Code
Your project's code (Jupyter notebooks, Python scripts, etc.), based on
the notebook template.
Your code should be well-documented and reproducible.
4. Data
- If your data files are under 10 MB total, include them directly
- If larger, host them on Google Drive, Dropbox, or another accessible
- Either way, your notebook must include code for downloading and
in your project folder.
cloud source.
importing the data so anyone can reproduce your results.
Weekly Rhythm
| Day | Activity |
|---|---|
| Monday | Review and discuss data stories from the previous week |
| Wednesday | New tools, demos, and techniques |
| Thursday (X-hour) | Office hours or hackathon/demos |
| Friday | Hackathon + brainstorming for next week's stories |
Tips for Great Data Stories
- Start with a question, not a dataset. What do you want to know?
- Keep it focused. A 5-minute story should make one or two clear
- Show, don't tell. Let your visualizations carry the narrative.
- Iterate. Your first attempt won't be perfect — use feedback from
- Be creative. The best stories surprise the audience or change how
- Collaborate. Build off each others' code, questions, and ideas.
points, not try to cover everything.
classmates to improve.
they think about something.
Interdisciplinary teams often produce the most interesting work.
Finding Data
Need inspiration? Here are some places to find interesting datasets:
- Kaggle Datasets
- FiveThirtyEight Data
- Awesome Public Datasets
- World Bank Open Data
- World Health Organization
- Registry of Open Data on AWS
- Reddit: r/datasets
- Google Dataset Search
Evaluation
Each data story will be evaluated on:
| Criterion | What we're looking for |
|---|---|
| Question | Is the question interesting and well-defined? |
| Data | Is the data appropriate for the question? |
| Analysis | Is the analysis sound and well-executed? |
| Visualization | Do the figures effectively communicate insights? |
| Narrative | Does the story flow logically and engage the audience? |
| Code quality | Is the code clean, documented, and reproducible? |
| Feedback | Did you incorporate feedback from previous iterations? |
Submitting Your Story
- Create a folder in
data-stories/named descriptively (e.g., - Include your README.md, notebook(s), and data files
- Upload your video to YouTube (unlisted is fine)
- Submit a pull request to the course repository
data-stories/climate-trends/)