24. Nov 2021

Provable Representation Learning: The Importance of Task Diversity and Pretext Tasks

Datum: 24. November 2021 | 16:00 – 18:00
Sprecher: Qi Lei, Princeton University
Veranstaltungsort: Zoom Link: https://istaustria.zoom.us/j/98066215937?pwd=YmZoWDAzME13dC9LU0Jwc1kzWVphQT09 Meeting ID: 980 6621 5937 Passcode: 177564

Modern machine learning models are transforming applications in various domains at the expense of a large amount of hand-labeled data. In contrast, humans and animals first establish their concepts or impressions from data observations. The learned concepts then help them to learn specific tasks with minimal external instructions. Accordingly, we argue that deep representation learning seeks a similar procedure: 1) to learn a data representation that filters out irrelevant information from the data; 2) to transfer the data representation to downstream tasks with few labeled samples and simple models. In this talk, we study two forms of representation learning: supervised pre-training from multiple tasks and self-supervised learning.

Supervised pre-training uses a large labeled source dataset to learn a representation, then trains a simple (linear) classifier on top of the representation. We prove that supervised pre-training can pool the data from all source tasks to learn a good representation that transfers to downstream tasks (possibly with covariate shift) with few labeled examples. We extensively study different settings where the representation reduces the model capacity in various ways. Self-supervised learning creates auxiliary pretext tasks that do not require labeled data to learn representations. These pretext tasks are created solely using input features, such as predicting a missing image patch, recovering the color channels of an image, or predicting missing words. Surprisingly, predicting this known information helps in learning a representation useful for downstream tasks. We prove that under an approximate conditional independence assumption, self-supervised learning provably learns representations that linearly separate downstream targets. For both frameworks, representation learning provably and drastically reduces sample complexity for downstream tasks.

Cookie	Dauer	Beschreibung
cookielawinfo-checkbox-analytics	1 Jahr	Dieses Cookie wird vom GDPR Cookie Consent Plugin gesetzt. Das Cookie wird verwendet, um die Zustimmung des/der NutzerIn für die Cookies in der Kategorie "Analyse" zu speichern.
cookielawinfo-checkbox-necessary	1 Jahr	Dieses Cookie wird vom GDPR Cookie Consent Plugin gesetzt. Das Cookie wird verwendet, um die Zustimmung des/der NutzerIn für die Cookies in der Kategorie "Notwendig" zu speichern.
CookieLawInfoConsent	1 Jahr	Dieses Cookie erscheint nur, wenn Sie Änderungen an den Kategorien vornehmen (z. B. die Kategorie "Analyse" deaktivieren).
pll_language	1 Jahr	Die ISTA-Website ist eine mehrsprachige Website. WordPress erfordert die Auswahl einer Standardsprache, die von dem Cookie pll_language von Polylang gesetzt wird, damit die Website angezeigt werden kann. Außerdem wird dieses Cookie verwendet, um sich die von dem/der NutzerIn gewählte Sprache zu merken, wenn NutzerInnen zur Website zurückkehren.
viewed_cookie_policy	1 Jahr	Das Cookie wird vom GDPR Cookie Consent Plugin gesetzt und wird verwendet, um zu speichern, ob der/die NutzerIn der Verwendung von Cookies zugestimmt hat oder nicht. Es werden keine personenbezogene Daten gespeichert.

Cookie	Dauer	Beschreibung
_pk_id*	13 Monate	Matomo Tracking-Cookie speichert eine eindeutige Benutzer-ID
_pk_ses*	13 Monate	Matomo Tracking-Cookie speichert eine eindeutige Session-ID

Provable Representation Learning: The Importance of Task Diversity and Pretext Tasks

Den ISTA Newsletter bestellen

FOLGEN SIE UNS