[논문 리뷰] Prediction-Oriented Bayesian Active Learning

250x250

Notice

Recent Posts

Tags more

Archives

관리 메뉴

끄적거림

개인 공부 정리/Bayesian

Signing 2023. 4. 22. 17:54

728x90

2.1 Active Learning

A/L은 라벨링을 하는 작업인데, 흔히 label acquisition이라고 부른다. 라벨을 얻는 작업이니까.

acquisition을 할때 몇 가지 step이 있는데,각 step $t$는 3가지의 작업으로 나뉜다.

첫 번째, 알고리즘은 라벨링을 위해 query input $x_t$를 얻고,
주어진 input에 대해 라벨링에 필요한 utility를 포착하기 위한 acquisition function을 maximize한다.
두 번째, 알고리즘은 조건부 라벨 분포 $p(y|x=x_t)$으로부터 라벨 $y_t$을 sampling하고, 이를 training dataset에 $(x_t, ~ y_t)$를 넣는다.
세 번째, 예측모델 $p_{\phi}(y|x)$을 업데이트 한다.

2.2 Bayesian experimental design

Bayesian experimental design은 실험으로부터 정보 습득에 대해 quantifying 할 수 있는 framework이다.

A/L에서는 input $x$를 실험의 design으로 볼 수 있고, label $y$는 experiment의 결과로 볼 수 있다.

$\psi$를 학습을 통해 초점을 맞추고 있는 정보의 양이라고 정의하고, prior $p(\psi)$, likelihood $p(y|x, \psi)$로 정의하면, 실험$(x,y)$을 통해 얻는 정보의 양 $\psi$를 quantify할 수 있다.
Shannon entropy를 사용함

$y$가 random variable이기 때문에, $p_{\psi}(y|x) = E_{p(\psi)}[p(y|x,\psi)]$임을 활용해 $y$에 대한 $EIG$를 marginal predictve distribution으로 만들 수 있다.
$(x,y)$에 대한 conditioning 이후 $\psi$에서 uncertainty의 expected reduction이다.
이것은 $x$가 주어졌을 때, $y$와 $\psi$ 사이의 mutaul information인 $I(\psi;y|x)$와 동일하다.

2.3 Bayesian active learning by disagreement(BALD)

Bayesian A/L은 전통적으로 $\psi$에서 $\theta$를 세팅한 model parameter의 정보 습득을 목표로 한다.

728x90

[논문 리뷰] Prediction-Oriented Bayesian Active Learning - 3. The shortfalls of BALD (0)	2023.04.23
[논문 리뷰] Prediction-Oriented Bayesian Active Learning - 1. Introduction (0)	2023.04.22
[논문 리뷰] Prediction-Oriented Bayesian Active Learning - 0. Abstract (0)	2023.04.22
[논문 리뷰] On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification - 6.Experiments (0)	2023.04.03
[논문 리뷰] On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification - 4.Aleatoric Uncertainty in Bayesian Classification (0)	2023.04.01

'개인 공부 정리/Bayesian' Related Articles

Comments