CMSC848O
Selected Topics in Information Processing; Long-Context Language Models
Focuses on recent developments in training, aligning, and evaluating long-context language models, which have allowed cutting-edge LLMs to process and generate millions of words. Topics include neural architectures (e.g., Transformers, Mamba), extended context fine-tuning/upscaling, and tasks such as summarization and QA over books.
Sister Courses: CMSC848B, CMSC848C, CMSC848D, CMSC848E, CMSC848F, CMSC848G, CMSC848I, CMSC848J, CMSC848K, CMSC848M, CMSC848Q, CMSC848Z
Spring 2025
0 reviews
Average rating:
N/A