Build a Content Agent

Design an AI agent that discovers, evaluates, and curates content at scale

BUILD LAB ~30 min

What You'll Build

Content agents power recommendation systems, search platforms, and knowledge bases at scale. They combine retrieval, evaluation, and synthesis to find and present relevant information.

Learning Outcomes

Design a multi-step agent pipeline for content discovery
Implement evaluation criteria that scale with content volume
Build feedback loops that improve relevance over time
Handle content quality and source credibility assessment
Integrate user context to personalize agent behavior

Portfolio Artifacts

In this lab, you'll create:

A content discovery pipeline showing how an agent retrieves candidate content, scores relevance, and filters by quality threshold before presentation to users.

Evaluation rubric documentation that defines how the agent assesses content credibility, freshness, and alignment with user interests.

Failure analysis report identifying where content agents go wrong: ranking errors, source bias, over-filtering, or insufficient personalization.

The Scenario

📰 News Personalization at Scale

You're building a content agent for a news aggregation platform. The agent ingests thousands of articles daily, evaluates them against user interests and credibility standards, and ranks the top stories for each user.

The challenge: Your agent is flooding users with feel-good stories because they're easy to find and highly shareable—but missing critical hard news that users actually need. Meanwhile, some credible but niche sources are getting filtered out entirely.

The question: How do you design an agent that balances engagement metrics with editorial integrity?

Breaking It Down

Retrieval Stage: The agent must search a content corpus using user interests as queries. But broad searches return too much; narrow searches miss important context.

Evaluation Stage: Scoring content for relevance, credibility, freshness, and bias is complex. Each dimension conflicts with the others sometimes.

Ranking & Filtering: The agent decides which content makes the final cut. Too strict a filter? Users miss important stories. Too loose? Content quality plummets.

Feedback Loop: User engagement signals (clicks, time spent, shares) shape what the agent learns. But they reward clickbait and outrage—not truth.

What Makes This Hard

Conflicting objectives: Engagement vs. accuracy, speed vs. quality, personalization vs. breadth
Source credibility: How do you teach an agent to recognize trustworthy sources?
Recency bias: Breaking news gets high engagement, but older analysis is sometimes more valuable
Filter bubbles: Optimizing for user preference can trap people in narrow information silos

Content Agent Architecture

Core Components

1. Content Ingestion & Parsing

The agent consumes articles, blog posts, videos, or structured data from multiple sources. It extracts metadata: author, publication date, source credibility score, topic tags, and raw content.

Key question: How do you know if a source is trustworthy? (By reputation, fact-checking records, editorial standards, peer review? All are incomplete.)

2. User Context & Intent Modeling

The agent builds a model of what each user cares about. This comes from explicit signals (preferences, searches, saved items) and implicit signals (reading time, engagement patterns, topic overlap).

Key question: When should the agent expose users to content outside their preferences? (Serendipity vs. echo chambers)

3. Relevance Scoring

Given a piece of content and a user, the agent assigns a relevance score. This combines textual similarity, topic alignment, semantic matching, and learned preferences.

Key question: How do you prevent the algorithm from reinforcing what users already think?

4. Quality & Credibility Filtering

The agent filters content by credibility (fact-check status, source reputation), freshness (publish date, topic recency), and quality (writing clarity, evidence density, expert authorship).

Key question: Who decides what counts as "high quality"? And can those standards change per user?

5. Ranking & Diversity

The agent ranks surviving content by relevance, but also ensures diversity: multiple perspectives on the same topic, mix of news types (breaking vs. analysis vs. opinion), and sources of varying credibility levels.

Key question: Should the agent show users content that challenges them?

Common Failure Modes

Engagement Trap

The agent optimizes for user clicks and time-on-site, which rewards sensationalism, outrage, and conspiracy content over factual news. Engagement metrics and truth diverge.

Source Bias

The agent learns that certain sources (e.g., major newspapers) are "safer" to recommend, while niche outlets (specialist publications, independent journalists) get downranked despite high quality. Distribution becomes centralized.

Echo Chamber

By personalizing content to each user's history, the agent creates information silos. Users see more of what they already like and less of what they don't, narrowing their worldview.

Design Principles for Content Agents

1. Explicit Diversity Goals

Don't just optimize for relevance. Explicitly require diversity of sources, perspectives, and content types. Make it a hard constraint, not a soft bonus.

2. Separate Evaluation from Engagement

Train content evaluation on explicit quality metrics (fact-checks, expert ratings, citations) separate from user engagement. Don't let clicks define quality.

3. Credibility Without Censorship

Source credibility should inform ranking, not exclusion. Low-credibility sources can appear, but lower. Users can still choose to engage.

4. User Control

Transparency: show users why content was ranked this way. Autonomy: let users adjust filters and preferences. Trust follows.

Reference & Resources

Key Concepts

Retrieval-Augmented Generation (RAG): Content agents use retrieval to find relevant sources, then use LLMs to summarize or synthesize.
Multi-Objective Optimization: Balancing multiple goals (relevance, credibility, diversity, engagement) requires explicit trade-off decisions.
Feedback Loops: User engagement signals train future rankings, but can reinforce bias if not carefully designed.
Source Attribution: Content agents must transparently cite where recommendations come from.
Recency vs. Depth: Breaking news is timely but shallow; analysis takes time but adds perspective.

Evaluation Checklist

When building a content agent, ask:

How does the agent decide what counts as "credible"?
What signals drive ranking? (Engagement? Expert rating? Source reputation?)
How often is user feedback incorporated? Can the agent adapt per-user?
What sources are excluded? Why? Who decided?
Does diversity happen organically or by explicit design?
How transparent is the agent? Can users understand why they see what they see?

Real-World Examples

Google News: Combines crawled articles, ranking by domain authority and freshness, with some personalization.
Twitter/X Explore Tab: Surfaces trending topics and trending tweets, balancing virality with source credibility.
Reddit's r/all: Community voting drives ranking; explicit rules prevent spam and misinformation.
Spotify Discover Weekly: Blends user listening history (personalization) with global trending (discovery).
Recommendation.com systems: Netflix, Amazon, YouTube all face the engagement vs. quality trade-off.

Build Your Content Agent

Content Agent Design Lab

Design a content agent for one of these scenarios. The AI coach will guide your design and challenge your assumptions.

Your task: Design a content agent for ONE of these scenarios. You'll specify the architecture, evaluation criteria, and failure mitigation strategy. The AI coach will ask clarifying questions and pressure-test your design.

Choose Your Scenario

Scenario A: Academic Research Aggregator
Your agent surfaces recent papers for researchers. It must rank by relevance to their field, but also surface seminal older papers and cross-disciplinary work that's often missed. How do you balance freshness with foundational knowledge?

Scenario B: Healthcare Information Portal
Your agent helps patients understand conditions and treatments. It must filter out junk medical advice, but also show multiple legitimate perspectives (surgery vs. medicine vs. lifestyle). How do you prevent the agent from overwhelming patients with conflicting information?

Scenario C: Job Listing Curator
Your agent shows job seekers opportunities matching their skills and interests. But it should also surface stretch roles and overlooked employers. How do you avoid showing the same safe companies to everyone?

Pick one above, or describe your own scenario. Then talk through your agent design with the coach.

Lab Guidelines

Be specific: Don't say "the agent ranks by relevance"—explain what relevance means in your scenario.
Address trade-offs: Every design choice sacrifices something. Name the trade-offs.
Assume scale: Your agent will process millions of items. Design for efficiency.
Plan for failure: What happens when the agent gets it wrong? How does it recover?
Consider users: Who is your agent serving? What are they trying to accomplish?