オムライスの備忘録

数学・統計学・機械学習・プログラミングに関することを記す

【マルチモーダル】VirTex

データサイエンスデータサイエンス-マルチモーダル

Index

Index
VirTex
参考
- Web サイト

VirTex

深層学習を用いた画像系のタスクへの事前学習の研究.

事前学習
- yhayato1320.hatenablog.com

事前学習したモデルをダウンストリーム (下流) のタスクで、ファインチューニングし、精度を向上させる.

また、自然言語から、画像系のタスクを学習する.

Natural Language Supervision
- yhayato1320.hatenablog.com

Visual Representations from Textual annotations

参考

VirTex: Learning Visual Representations from Textual Annotations
- [2020]
- Abstract
- 1 Introduction
- arxiv.org

Web サイト

VirTex: Learning Visual Representations from Textual Annotations
- cvpaper.challenge
- xpaperchallenge.org