Readings for Week 5

Updates

  • The blogs for Class 5 and Class 6 are now posted.
  • Every project team should have received feedback on your Project Idea. The next main deliverable for the project is your Project Mini-Proposal, which is due Monday, 16 February. (This will be discussed in class on Tuesday.)

Reading for Tuesday, 10 February (repeated from Readings for Week 4)

  • Cynthia Rudin. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, May 2019. [PDF Link] [arXiv version (less nicely formatted, but with fixed equations)]

Reading for Thursday, 12 February:

  • Milad Nasr, Javier Rando, Nicholas Carlini, Jonathan Hayase, Matthew Jagielski, A. Feder Cooper, Daphne Ippolito, Christopher A. Choquette-Choo, Florian Tramèr, and Katherine Lee. Scalable extraction of training data from aligned, production language models. In International Conference on Learning Representations (ICLR) 2025. ICLR Web Link. (You can also see the review discussion: ICLR Forum)

One of the authors of this paper, Matthew Jagielski (now at Anthropic), will visit UVA on Friday, 13 February and give a Distinguished Talk at 11:00am Friday, 13 February, in Rice 540.