2023: The Year LLMs Ate the World
Looking back on a year that transformed AI from a research curiosity into a force reshaping every industry
Thoughts on engineering leadership, distributed systems, architecture, and the craft of building software.
Looking back on a year that transformed AI from a research curiosity into a force reshaping every industry
Mistral AI's Mixtral model demonstrates that mixture of experts architectures can deliver frontier-class performance efficiently
Google launches Gemini, its most capable AI model, built from the ground up for multimodal reasoning
AWS re:Invent 2023 puts generative AI at the center of Amazon's cloud strategy with Bedrock and new services
Sam Altman's firing and rehiring at OpenAI exposes the tensions at the heart of AI development
OpenAI's first developer conference reveals its vision for an AI application platform
Documenting my journey from infrastructure engineer to hands-on AI researcher and builder
The emergence of structured tool use and function calling in LLMs points toward a protocol-driven future for AI integration
Llama 2's open release changes the dynamics of AI development and gives enterprises new deployment options
The AI framework ecosystem is exploding with tools for building LLM-powered applications
Anthropic's Claude model and Constitutional AI represent a fundamentally different philosophy in the AI race
After years in cloud infrastructure, I am making a deliberate shift toward AI and large language models
AutoGPT introduces the concept of autonomous AI agents that can decompose tasks and execute multi-step plans
GPT-4 launches with multimodal capabilities, passing professional exams and setting a new benchmark for AI
Google announces Bard in response to ChatGPT, and the large language model race officially begins
Microsoft's massive investment in OpenAI signals a new era of AI competition among the tech giants