AP Statistics · Post-Exam Project
You'll find your dataset at openintro.org/data. Every dataset there is clean, well-documented, and comes with a description of every variable. Browse until something catches your interest.
Explore the list of data sets before picking something — find something you're actually curious about!
These are the questions a statistician asks before touching any tool. Answer them in writing — you'll need them for your presentation.
Open an LLM of your choice (Large Language Model such as ChatGPT, Gemini, or Claude) and start a new conversation. Note: the minimum age requirement for Gemini and ChatGPT is 13+ while Claude is 18+. Copy the prompt below, upload your CSV, and let the LLM ask you the clarifying questions before it builds anything.
Save your file as teammate1_teammate2.html using both of your last names (e.g., chen_morgan.html). Download it from the LLM's code panel and submit it to Google Classroom by the end of Monday's class. This is your checkpoint — it doesn't need to be finished, but it needs to run correctly in a browser.
Also submit your chat transcript from today's session. Export or copy it as a PDF or shared link and attach it to the same Classroom submission. The transcript is evidence that you were directing the work — it should show you asking questions, pushing back, and making decisions, not just accepting whatever the LLM produced first.
This is the part that makes your visualization worth presenting. Your job on Thursday is to add at least one interactive layer that paper and pencil simply cannot do — something that lets a viewer explore your data rather than just look at a picture of it.
Before you ask the LLM to build anything, decide with your group what you want and why. The feature should serve the questions you wrote down in Phase 1, not just look impressive. Be ready to explain that connection on Friday.
By the start of Thursday's class, submit your updated teammate1_teammate2.html to Google Classroom. This is the version that will be hosted in the class gallery at mathclass.today — make sure it opens correctly in a browser with no internet connection required.
Also submit your Thursday chat transcript alongside the file. Together, your Monday and Thursday transcripts are the paper trail for your decisions. If you're asked on Friday why you chose a particular chart type, interaction, or color scheme, your transcripts should back up your answer — they should show you proposing ideas, evaluating options, and steering the tool, not just running the LLM's first suggestion.
Presentations are on Friday, May 15. Each pair has exactly 5 minutes. You'll open your HTML file in the browser and walk the class through your data story. You are not summarizing your process — you are making an argument about what your data shows.
| What We're Looking For | What It Means | Pts |
|---|---|---|
| Statistical accuracy | Chart type matches data type; labels and scales are correct; sample/population distinction is stated clearly | 2 |
| Interactive layer + decision | Goes meaningfully beyond a paper chart; you can explain why you added it and what it reveals | 3 |
| Inference | Correct procedure chosen; conditions addressed; result interpreted in context of your question | 3 |
| Data story + finding | Visual and inference connect to a clear, specific claim about your data | 1 |
| Polish | Fonts are readable, axis labels are clear and correctly sized, titles are accurate, no typographical errors — the visualization looks finished | 1 |
| Presentation | 5-minute limit respected; both partners speak; demo is live and working | 1 |
| Chat transcripts | All transcripts submitted; they show you directing the AI — proposing ideas, making decisions, and pushing back — not just accepting the first output | 1 |