Tech Media news · AI Feed

Eugene Yan Tech Media June 22, 2025

Evaluating Long-Context Question & Answer Systems

Evaluation metrics, how to build eval datasets, eval methodology, and a review of several benchmarks.

Eugene Yan Tech Media June 4, 2025

AI Engineer 2025 – Improving RecSys & Search with LLM techniques

Recsys & search are converging with LLMs via semantic IDs, data augmentation, and unified foundation models.

Eugene Yan Tech Media May 18, 2025

Exceptional Leadership: Some Qualities, Behaviors, and Styles

What makes a good leader? What do good leaders do? And commando, soldier, and police leadership.

Eugene Yan Tech Media May 4, 2025

Building News Agents for Daily News Recaps with MCP, Q, and tmux

Learning to automate simple agentic workflows with Amazon Q CLI, Anthropic MCP, and tmux.

Lilian Weng Tech Media May 1, 2025

Why We Think

Special thanks to John Schulman for a lot of super valuable feedback and direct edits on this post. Test time compute (Graves et al. 2016, Ling,…

Eugene Yan Tech Media April 20, 2025

An LLM-as-Judge Won't Save The Product—Fixing Your Process Will

Applying the scientific method, building via eval-driven development, and monitoring AI output.

Eugene Yan Tech Media March 30, 2025

Frequently Asked Questions about My Writing Process

How I started, why I write, who I write for, how I write, and more.

Jay Alammar Tech Media March 26, 2025

Moving To Substack

I’m freezing this blog and starting to post on my Substack instead. The authoring experience is much more convenient for me there. Please follow me there,…

Eugene Yan Tech Media March 18, 2025

NVIDIA GTC 2025 – Building LLM-Powered Applications

Chip Huyen and I share what we've learned, best practices, and insights at NVIDIA GTC 2025.

Eugene Yan Tech Media March 16, 2025

Improving Recommendation Systems & Search in the Age of LLMs

Model architectures, data generation, training paradigms, and unified frameworks inspired by LLMs.

Chip Huyen Tech Media January 16, 2025

Common pitfalls when building generative AI applications

As we’re still in the early days of building applications with foundation models, it’s normal to make mistakes. This is a quick note with examples of…

Eugene Yan Tech Media January 12, 2025

Building AI Reading Club: Features & Behind the Scenes

Exploring how an AI-powered reading experience could look like.

Chip Huyen Tech Media January 7, 2025

Agents

Intelligent agents are considered by many to be the ultimate goal of AI. The classic book by Stuart Russell and Peter Norvig, Artificial Intelligence: A Modern…

Eugene Yan Tech Media December 22, 2024

2024 Year in Review

A peaceful year of steady progress on my craft and health.

Hamel Husain Tech Media December 13, 2024

nbsanity – Share Notebooks as Polished Web Pages in Seconds

nbsanity - Share Notebooks as Polished Web Pages in Seconds

Eugene Yan Tech Media December 1, 2024

Seemingly Paradoxical Rules of Writing

With regard to writing, there are many rules and also no rules at all.

Hamel Husain Tech Media November 30, 2024

Building an Audience Through Technical Writing: Strategies and Mistakes

Lilian Weng Tech Media November 28, 2024

Reward Hacking in Reinforcement Learning

Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing…

Eugene Yan Tech Media November 24, 2024

How to Run a Weekly Paper Club (and Build a Learning Community)

Benefits of running a weekly paper club, how to start one, and how to read and facilitate papers.

Eugene Yan Tech Media November 17, 2024

My Minimal MacBook Pro Setup Guide

Setting up my new MacBook Pro from scratch

Eugene Yan Tech Media November 3, 2024

39 Lessons on Building ML Systems, Scaling, Execution, and More

ML systems, production & scaling, execution & collaboration, building for users, conference etiquette.

Hamel Husain Tech Media October 29, 2024

Using LLM-as-a-Judge For Evaluation: A Complete Guide

Eugene Yan Tech Media October 27, 2024

AlignEval: Building an App to Make Evals Easy, Fun, and Automated

Look at and label your data, build and evaluate your LLM-evaluator, and optimize it against your labels.

Eugene Yan Tech Media September 22, 2024

Weights & Biases LLM-Evaluator Hackathon – Hackathon Judge

Being a human judge at the Weights & Biases LLM-as-a-Judge Hackathon

Tech Media 164 stories