What makes a good "impossible" question?

Asking about something completely unrelated to the dataset
Asking about a feature that isn't in the dataset (e.g., "what's their favorite color?" when no color column exists)
These are too easy to spot — the sleuth will know immediately

Ask about values or patterns that seem answerable but actually require data you don't have
The sleuth should have to analyze the data — not just glance at the columns — to figure out whether it's possible
The question should be interesting enough that the sleuth wants to answer it, even if they can't
Examples:
- You can only know if the question is answerable after running some analyses (e.g., groups A and B aren't distinguishable on any single feature, but they are distinguishable when you look at the interaction of two features)
- Features co-vary in a way that makes it impossible to disentangle their effects (e.g., all the students with high screen time also have low sleep hours, so you can't tell which one is driving any observed effects on GPA)
- The question is about a pattern that could be in the data but isn't (e.g., ask something about a subgroup of students with high GPA, high sleep hours, and low screen time when no such subgroup exists)

Creating data