Index

Index
Large Language Model / LLM
アルゴリズム
テクニック・工夫
研究
サービス・アプリケーション
参考
- 書籍
- Web サイト
  - Tweet
- 動画

Large Language Model / LLM

自然言語処理アルゴリズム #まとめ編
- 深層学習を用いたアルゴリズム
- yhayato1320.hatenablog.com

アルゴリズム

LaMDA / 2022

LaMDA
- yhayato1320.hatenablog.com

LM-DESIGN / 2023

Structure-informed Language Models Are Protein Designers
- [2023]
- arxiv.org

Clinical-T5 / 2023

Do We Still Need Clinical Language Models?
- [2023]
- arxiv.org

LEALLA / 2023

LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings with Knowledge Distillation
- [2023]
- arxiv.org

GLM-Dialog / 2023

中国語？

GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation
- [2023]
- arxiv.org

BitNet / 2023

BitNet b1.58 / 2024

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
- [2024]
- arxiv.org
Microsoftが1.58ビットの大規模言語モデルをリリース、行列計算を足し算にできて計算コスト激減へ
- gigazine.net

テクニック・工夫

Expert LM vs MultiTask LM

Expert LM / 2023

Exploring the Benefits of Training Expert Language Models over Instruction Tuning
- [2023]
- arxiv.org

Open AGI

OpenAGI: When LLM Meets Domain Experts
- [2023]
- arxiv.org
- github.com

Prompting

入力に対する工夫.

Prompting
- yhayato1320.hatenablog.com

Reinforcement Learning

Reinforcement Learning
- yhayato1320.hatenablog.com

Fine Turning

Parameter Efficient Fine Tuning / PEFT

Parameter Efficient Fine Tuning / PEFT
- yhayato1320.hatenablog.com

LLM-Adapters / 2023

LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
- [2023]
- arxiv.org
- github.com

LIMA: Less Is More for Alignment

LIMA: Less Is More for Alignment
- [2023]
- arxiv.org
[2023/05/24]Machine Learning 輪講
- github.com

Instruction Tuning

Instruction Tuning
- yhayato1320.hatenablog.com

研究

Fine-tuning language models to find agreement among humans with diverse preferences
- [2023]
- arxiv.org

Pre Training

Unnatural Instructions / 2023

Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor
- [2023]
- arxiv.org

Alignment / 協調性

LLM は、様々なタスクに適応する能力を有している一方、生成される文章の品質や正確さは最適とは言えない.

そこで、LLM の「Alignment / 協調性」という概念がある.

ユーザの指示を汲み取って適切な返答をするだけではなく、「社会通年に反しない」のような要求も含まれる.

Alignment / 協調性
- yhayato1320.hatenablog.com

Watermark / 2023

「透かし」を入れることで、言語モデルが生成したテキストかどうかを判別させる研究.

トークンの禁止リストと許可リストを作り、許可リストのみからサンプリングをする.

人間の文書は禁止リストを含んだ文書になるため、人間が書いたものかを統計的な尺度で判別できる.

A Watermark for Large Language Models
- [2023]
- arxiv.org

Conversational APR / 2023

Conversational Automated Program Repair
- [2023]
- arxiv.org

Hindsight Instruction Relabeling / HIR / 2023

The Wisdom of Hindsight Makes Language Models Better Instruction Followers
- [2023]
- arxiv.org

Gradient-Based Automated Iterative Recovery / GBAIR / 2023

Gradient-Based Automated Iterative Recovery for Parameter-Efficient Tuning
- [2023]
- arxiv.org

RECITation-augmented gEneration / RECITE / 2023

Recitation-Augmented Language Models
- [2023]
- arxiv.org

LLM-AUGMENTER / 2023

Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback
- [2023]
- arxiv.org

MUX-PLMs / 2023

MUX-PLMs: Pre-training Language Models with Data Multiplexing
- [2023]
- arxiv.org

FlexGen / 2023

限られた GPU メモリで LLM を実行するための高スループット生成エンジン.

High-throughput Generative Inference of Large Language Models with a Single GPU
- [2023]
- arxiv.org
- github.com

Universal Prompt Retrieval for Improving zero-Shot Evaluation / UPRISE / 2023

UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation
- [2023]
- arxiv.org

AnnoLLM / 2023

AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators
- [2023]
- arxiv.org

Self-Refine / 2023

Self-Refine: Iterative Refinement with Self-Feedback
- [2023]
- arxiv.org

研究

Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models
- [2023]
- arxiv.org
AutoBiasTest: Controllable Sentence Generation for Automated and Open-Ended Social Bias Testing in Language Models
- [2023]
- arxiv.org
LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models
- [2023]
- LLM の評価と表現の可視化
- arxiv.org

サービス・アプリケーション

OpenICL

OpenICL: An Open-Source Framework for In-context Learning
- arxiv.org
- github.com

OpenChatKit

GPT-NeoXベースでinstruction-tuningされたオープンな20Bモデル. 6Bの不適切発話判定モデル，43M件のデータセットOIG，20万件の不適切発話データセットOIG-moderationを提供.

OpenChatKit
- github.com

LangChain

LangChain
- yhayato1320.hatenablog.com

Dolly

Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM
- www.databricks.com
Hello Dolly: オープンなモデルでChatGPTの魔法を民主化する
- qiita.com

-Free Dolly: 世界初の真にオープンな指示でチューニングされたLLM - qiita.com

無料で商用利用も可能なオープンソースの大規模言語モデル「Dolly 2.0」をDatabricksが発表
- gigazine.net
Databricks、ChatGPT風の大規模言語モデル「Dolly 2.0」。オープンソースで商用利用可能
- pc.watch.impress.co.jp
無料・商用利用可なオープンソースの大規模言語モデル Dolly 2.0(dolly-v2-12b) を試してみた
- qiita.com
Google Colab で Dolly 2.0 を試す
- note.com

StableLM

Trained Models.

StableLM: Stability AI Language Models
- github.com
- ja.stability.ai

LLM Zoo

LLM Zoo: democratizing ChatGPT
- github.com

LMQL / Language Model Query Language

LMQL(Language Model Query Language)概観
- note.com

OpenCALM-7B

Google Colab で OpenCALM-7B のLoRAファインチューニングを試す
- note.com
Google Colab + trlx で OpenCALM のRLHFファインチューニングを練習する
- note.com
kyo-takano/OpenCALM-7B
- https://huggingface.co/spaces/kyo-takano/OpenCALM-7Bhuggingface.co

cyberagent

Cyber Agent LLM をシュッと触ってみた
- note.com
CyberAgent社の日本語LLM OpenCALMの対話モデル用途のfinetune検証
- tech.acesinc.co.jp

NLLB

No Language Left Behind
- github.com
単一のAIモデルで200言語を翻訳: 高品質機械翻訳のブレイクスルー
- ai.facebook.com

Galatica

Galatica
- github.com

LLM Collection

LLM Collection
- www.promptingguide.ai

PLaMo / 2023

PLaMo-13Bを公開しました
- tech.preferred.jp
PLaMo-13bをColabで量子化して試してみた
- note.com

Karasu / Qarasu / 2023

KarasuとQarasu：最先端の日本語LLMオープンソースチャットボット
- note.com
Lightblue、商用利用可能な日本語LLM「Karasu」「Qarasu」を公開
- prtimes.jp

参考

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
- [2021]
- arxiv.org
Dissociating language and thought in large language models: a cognitive perspective
- [2023]
- arxiv.org
Creating a Large Language Model of a Philosopher
- [2023]
- arxiv.org

A Survey of Large Language Models

[2023]
arxiv.org

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond

[2023]
arxiv.org

書籍

大規模言語モデルは新たな知能か
- 大規模言語モデルは新たな知能か――ChatGPTが変えた世界 (岩波科学ライブラリー)
  - 作者:岡野原大輔
  - 岩波書店
  Amazon

Web サイト

Large Language Models: A New Moore's Law?
- huggingface.co
Understanding Large Language Models -- A Transformative Reading List
- sebastianraschka.com
large_language_model_training_playbook
- github.com
CS324 - Large Language Models
- stanford-cs324.github.io
大規模言語モデルの驚異と脅威
- speakerdeck.com
大規模言語モデルの学習コードまとめ
- note.com
Awesome-LLM
- github.com
大規模言語モデルを自社でトレーニング＆活用する方法
- note.com
The Practical Guides for Large Language Models
- github.com
Research
- projects.laion.ai
GoogleはAI開発競争における防壁を持っていません！OpenAIもです！
- webbigdata.jp
次々と発表されるオープンな日本語大規模モデル
- zenn.dev
ChatGPTなどの大規模言語モデルはどんな理論で成立したのか？重要論文24個まとめ
- gigazine.net

https://note.com/npaka/n/n1d99253ae2cf

ChatGPT vs the biggest competitors:

-ChatGPT (GPT 3.5): 175B Parameters
-Bard (Google LaMDA): 137B Parameters
-Baidu Ernie: 260B Parameters
-LG Exaone: 300B Parameters
-Nvidia Megatron: 530B Parameters
-Google PaLM: 540B Parameters pic.twitter.com/tOjQzGsocN
— Rowan Cheung (@rowancheung) February 24, 2023

動画

FY2022 AIPシンポジウム特別講演
- 大規模言語モデルの驚異と脅威
- www.youtube.com