Stat 406
Geoff Pleiss
Last modified – 02 September 2025
\[ \DeclareMathOperator*{\argmin}{argmin} \DeclareMathOperator*{\argmax}{argmax} \DeclareMathOperator*{\minimize}{minimize} \DeclareMathOperator*{\maximize}{maximize} \DeclareMathOperator*{\find}{find} \DeclareMathOperator{\st}{subject\,\,to} \newcommand{\E}{E} \newcommand{\Expect}[1]{\E\left[ #1 \right]} \newcommand{\Var}[1]{\mathrm{Var}\left[ #1 \right]} \newcommand{\Cov}[2]{\mathrm{Cov}\left[#1,\ #2\right]} \newcommand{\given}{\ \vert\ } \newcommand{\X}{\mathbf{X}} \newcommand{\x}{\mathbf{x}} \newcommand{\y}{\mathbf{y}} \newcommand{\P}{\mathcal{P}} \newcommand{\R}{\mathbb{R}} \newcommand{\norm}[1]{\left\lVert #1 \right\rVert} \newcommand{\snorm}[1]{\lVert #1 \rVert} \newcommand{\tr}[1]{\mbox{tr}(#1)} \newcommand{\brt}{\widehat{\beta}^R_{s}} \newcommand{\brl}{\widehat{\beta}^R_{\lambda}} \newcommand{\bls}{\widehat{\beta}_{ols}} \newcommand{\blt}{\widehat{\beta}^L_{s}} \newcommand{\bll}{\widehat{\beta}^L_{\lambda}} \newcommand{\U}{\mathbf{U}} \newcommand{\D}{\mathbf{D}} \newcommand{\V}{\mathbf{V}} \]
Geoff Pleiss
Goal:
Assumptions:
A history lesson
Statisticians are developing frameworks for reasoning, predicting, and making decision from data.
🧐 Aren’t these the same goals that the AI community has? 🧐
Each module has a technical content theme (and a statistical principles theme)
Component | Points |
---|---|
Midterm | 10 points |
Final | 30 points |
Total | 40 points |
Component | Points |
---|---|
In-Class | 15 points |
Labs | 20 points |
Homework | 40 points |
Total | 60 points |
Component | Points |
---|---|
In-Class | 15 points |
Labs | 20 points |
Homework | 40 points |
Total | 60 points |
\[\text{effort} = \min\left\{60, \text{class} + \text{lab} + \text{hw}\right\}\]
Important
8am is hard.
Attendance is optional.
If you’re sleep deprived
Getting more sleep may be beneficial to your learning in the long run
If you’re not going to engage/participate
Lectures are recorded, watch on your own time
If you’re going to have your laptop open
Do your homework/social media browsing/k-pop video watching in the comfort of your own home; then revisit the course material on your own time
What I Write for Students who Attend/Participate/Ask Questions
What I Write for Students who Just “Show Up”
I don’t get to know you, so all I can talk about is your grade.
The goal is to “Do the work”
Labs should give you practice, allow for questions with the TAs.
They are due at 2300 on the day of your lab, lightly graded.
You may do them at home, but you must submit individually (in lab, you may share submission)
Labs are lightly graded
Not easy, especially the first 2, especially if you are unfamiliar with R / Rmarkdown / ggplot
You may revise to raise your score to 7/10, see Syllabus. Only if you lose 3+ for content (penalties can’t be redeemed).
Don’t leave these for the last minute
Language/Libraries: R + Tidyverse
Submission: via Github
Important
We assume you’re familiar with these tools (R, Tidyverse, Git, Github)
If you’re not, it’s your responsibility to get up-to-speed with them.
See Canvas for tutorials / the website for resources.
You each have your own repo
You make a branch
DO NOT rename files
Make enough commits (3 for labs, 5 for HW).
Push your changes (at anytime) and make a PR against main
when done.
TAs review your work.
If you want to revise HWs, make changes in response to feedback and push to the same branch. Then “re-request review”.
Deep intuitions require struggle
The only way to develop intuitions about challenging material is to wrestle with content
Stand out from the crowd
Anyone can use Claude/Copilot for simple ML. Demonstrate thinking beyond these tools will make you more hireable/trusted
Course-specific subtleties
Even with good prompting, chatbots likely won’t score above 7-8 on assignments
No AI on exams
Don’t become too dependent on tools you can’t use during midterm/final
Self-reporting required:
If you have not submitted your lab/assignment by the time grading starts, you will get a 0.
When you submit | Likelihood that your submission gets a 0 |
---|---|
Before 11pm on due date (i.e. on time) | 0% |
11:01pm on due date | 0.01% |
9am after due date | 50% |
2 weeks after due date | 99.99999999% |
We will only make exceptions when you have grounds for academic consession. (See the UBC policy.)
Tip
Remember: you can still get a “perfect” effort grade even if you get a 0 on one assignment.
An Introduction to Statistical Learning
James, Witten, Hastie, Tibshirani, 2013, Springer, New York. (denoted [ISLR])
Available free online: http://statlearning.com/
The Elements of Statistical Learning
Hastie, Tibshirani, Friedman, 2009, Second Edition, Springer, New York. (denoted [ESL])
Also available free online: https://web.stanford.edu/~hastie/ElemStatLearn/
Coming to class – 3 hours
Reading the book – 1 hour
Labs – 1 hour
Homework – 4 hours
Study / thinking / playing – 1 hour
We will use R and we assume some background knowledge.
Suggest you use RStudio IDE
See https://ubc-stat.github.io/stat-406/ for what you need to install for the whole term.
Links to useful supplementary resources are available on the website.
Course website
All the material (slides, extra worksheets) https://ubc-stat.github.io/stat-406
Canvas (minimal)
Quiz 0, grades, course time/location info, links to videos from class
Slack
Discussion board, questions
Github
Homework / Lab submission
By EOD Tomorrow:
By Early Next Week:
UBC Stat 406 - 2024