Princeton NLP Blog

2023

MeZO: Fine-Tuning Language Models with Just Forward Passes

6 minute read

A memory-efficient method to fine-tune LMs.

FireAct: Toward Language Agent Fine-tuning

9 minute read

Blog Post by Baian Chen and Shunyu Yao.

Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

4 minute read

Blog Post by Mengzhou Xia and Tianyu Gao from Princeton University

Flash-Decoding for Long-Context Inference

6 minute read

Blog Post by Tri Dao (Princeton), Daniel Haziza (Meta), Francisco Massa (Meta), and Grigory Sizov (Meta).

Language Agents in the Digital World: Opportunities and Risks

12 minute read

Blog Post by Shunyu Yao and Karthik Narasimhan.

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

7 minute read

Blog Post by Professor Tri Dao.

The Socratic Method for Self-Discovery in Large Language Models

29 minute read

Is there a Theory of Anamnesis of Large Language Models?

Back to top ↑

2021

Phrase Retrieval and Beyond

17 minute read

Blog Post by Jinhyuk Lee.

Prompting: Better Ways of Using Language Models for NLP Tasks

21 minute read

Blog Post by Tianyu Gao.

Back to top ↑