PHYS 576 A: Selected Topics in Experimental Physics

TTh 12:00pm - 1:20pm
PAB B143
Miguel Morales

Welcome to Modern Data Analysis Techniques

Team taught by Miguel Morales (Physics) and Bryna Hazelton (eScience), the goal of this class is to introduce current techniques and best practices in the statistically rigorous analysis of large data sets. The class is organized around four themes:  practical statistics, advanced data visualization, collaborative analysis code, and advanced data analysis practices. 


Everyone learns so much better in person please come to class in person whenever possible, and the room is a new space B143 across from the SPS lounge. That said, many advanced students need to travel for research and covid is still around, so we will offer zoom on demand and will endeavor to record the classes. Send Miguel and email if you want zoom for a class, and this will be the link we use

Office Hours

Miguel Morales, Monday 1:30-2:30, plus by appointment or opportunity. C325.

Bryna Hazelton, Thursday 2-3 or by appointment. C-wing 6th floor (eScience Institute).


As a graduate elective, what you get out of the course largely depends on what you put into it. Further, this class is designed to scale depending on your interests and time. At one end, it is designed to provide motivated students with a firm grounding in advanced statistics and data analysis tools that can be used on a wide range of academic and professional problems. At the other end it is designed to serve as a low-pressure survey of modern analysis techniques. During the first week you will detail what your goals are, and your grade will be based on how well you achieve your goals. There will be no exams, with the homework and final project forming the basis of your grade. 


Themes:  Practical Statistics; Data Visualization; Collaborative Analysis; Advanced Data Analysis Practices

Week 1

Th:  Welcome; course overview; what does sigma mean? video, slides 

Homework: Intro quiz

Week 2

T:  Introduction to git & GitHub  video, slides

Th:  Statistical building blocks (videoslides)

Homework: Homework #1 (git game)

Week 3

T: Data visualization pt. 1;  (video; slides)

Th:  No class

Homework:  Homework #2 (intro to stats)

Stats reference:  

Statistics cheat sheet

Online reference chart

Song paper

Wikipedia entries can be useful, look under ‘related distributions’

Week 4

T: Data visualization pt. 2, workshopping plots, analysis plans, worry lists; (video; slides)

Th: Trials factors; parameter distributions; (videoslides)

Homework: Homework #3

Week 5

T:  Parameters cont.; Fisher matrix; triangle plots; variable backgrounds; (video; slides)

Th:  Statistically valid plots; jackknife tests;  (video; slides)

Week 6

T:  Developing an analysis plan; (video; slides)

Th:  Confidence intervals; (video; slides)

Week 7

T:  Metadata, Provenance & Test Thickets; (videoslides)

Th:  Stats mini-review; the blob, analysis dragons; (video, slides)

Week 8

T: Deconvolution/forward modeling; ML overview (video, slides)

Th:  Machine Learning (Sam Tetef); plots as a language (Sam's slides, Miguel's slides, video)

Week 9

T: Blind & semi-blind analyses; data rampages (video, slides)

Th: Thanksgiving

Week 10

T:  Presentations:  Jordan Fonseca; Charles Cardot (video)

Th:  Presentations:  Omar Beesley; Cautionary examples of statistical errors (video; slides)

Week 11

T:  Presentations:  Michaela Guzzetti; Akira Pfeffer; Chris Munley (video)

Th:  Presentations:  Caio Nascimento; Valeria Hurtado; Murali Saravanan


June 1, 2022 - 11:25pm