オムライスの備忘録

数学・統計学・機械学習・プログラミングに関することを記す

【マルチモーダル】ICMLM

データサイエンスデータサイエンス-マルチモーダル

Index

Index
ICMLM
参考

ICMLM

画像系のタスクでの事前学習の研究.

Pre Training
- yhayato1320.hatenablog.com

Masked Language Model に視覚情報を追加した.

Natural Language Supervision
- yhayato1320.hatenablog.com

Image-Conditioned Masked Language Modeling

参考

Learning Visual Representations with Caption Annotations
- [2020]
- Abstract
- arxiv.org