About the talk:
As deep learning classifiers continue to mature, model providers with sufficient data and computation resources are exploring approaches to monetize the development of increasingly powerful models. Licensing models is a promising approach, but requires a robust tool for owners to claim ownership of models, i.e. a watermark. Unfortunately, current watermarks are all vulnerable to piracy attacks, where attackers embed forged watermarks into a model to dispute ownership. We believe properties of persistence and piracy resistance are critical to watermarks, but are fundamentally at odds with the current way models are trained and tuned.
In this work, we propose two new training techniques (out-of-bound values and null-embedding) that provide persistence and limit the training of certain inputs into trained models. We then introduce wonder filters, a new primitive that embeds a persistent bit-sequence into a model, but only at initial training time. Wonder filters enable model owners to embed a bit-sequence generated from their private keys into a model at training time. Attackers cannot remove wonder filters via tuning, and cannot add their own filters to pretrained models. We provide analytical proofs of key properties, and experimentally validate them over a variety of tasks and models. Finally, we explore a number of adaptive counter-measures, and show our watermark remains robust.
A pre-print paper describing this work has been posted to arxiv.org.
Slides for the talk in PDF format for Download.
To access the live webcast of the talk (active at 16:28 of the day of the presentation) and the archived version of the talk, use the URL SU-EE380-201030. This is a first class reference and can be transmitted by email, Twitter, etc. BUT the panopto link will only be active until the YouTube link is available.
A URL referencing a YouTube view of the lecture will be posted to HERE a week or so following the presentation.
About the Speaker:
|Huiying Li is a PhD student in Department of Computer Science at University of Chicago. She works in SAND Lab, co-advised by Prof. Ben Y. Zhao and Prof. Heather Zheng. Her research interest is security in Machine Learning especially Deep Learning. She is also interested in security and privacy of Networks and Systems such as Online Social Networks (OSNs) and Internet of Things (IoT).|
|Emily Wegner (nee Willson) is a PhD student at the University of Chicago, studying under Dr. Ben Zhao. Her research interests include adversarial machine learning and applied security. Previously, she worked as a mathematician for the Department of Defense.|
Huiying Li huiyingli(at)cs.uchicago.edu Emily Wegner (nee Willson) ewillsons(at)cs.uchicago.edu