BERT-FP | Notion

논문명 : Fine-grained Post-training for Improving Retrieval-based Dialogue Systems, NAACL 2021

PLM을 이용해서 멀티턴 응답선택을 fine-tuning 하는 연구 (fine-grained)

어떤 PLM을 쓸까? → BERT or RoBERTa

기존 PLM을 바로 쓰지 않고, post-training을 통해 테스크에 맞는 PLM을 만들어서 사용하자.

즉 pre-training → post-training → fine-tuning

기존 학습 상황

context → response후보 중 선택

[context; response] → 0 or 1

논문

short context-response pairs 활용

Untitled

Short-context
- 기존의 컨텍스트에서 학습 데이터를 새로 구성
Candidate class
1. positive
2. 랜덤으로 추출한 negative
3. 그럴싸한데 아닌 false Negative (generate 또는 다양한 방법으로 생성)
4. 컨텍스트에서 추출한 negative
위의 방식으로 구성된 데이터로 모델 을 학습한다.