advanced

Subagent Blueprint: Finding What Nobody Is Talking About

This advanced pattern combines parallel sub-agents, structured output formats, and gap analysis to discover underreported trends and opportunities. It's a production-ready blueprint for automated rese

Subagent Blueprint: Finding What Nobody Is Talking About

Overview

This advanced pattern combines parallel sub-agents, structured output formats, and gap analysis to discover underreported trends and opportunities. It's a production-ready blueprint for automated research systems that go beyond surface-level summarization.

The Problem with Traditional Research

What Most AI Research Does

Typical approach:

  1. Search for topic
  2. Summarize top results
  3. Present summary

Result: Regurgitated content everyone already knows

What This Blueprint Does

Advanced approach:

  1. Parallel sub-agents search multiple sources simultaneously
  2. Structured output ensures consistent, actionable findings
  3. Pattern recognition identifies themes across sources
  4. Gap analysis finds what nobody is talking about yet
  5. Synthesis creates unique insights, not summaries

Result: Original intelligence that provides competitive advantage

Architecture

The Orchestrator Pattern

┌─────────────────────────────────────┐
│   Main Agent (Orchestrator)         │
│   - Plans research strategy          │
│   - Spawns parallel sub-agents      │
│   - Synthesizes findings            │
│   - Identifies gaps and patterns    │
└──────────────┬──────────────────────┘
               │
       ┌───────┴───────┬───────────┬──────────┐
       │               │           │          │
┌──────▼──────┐ ┌─────▼──────┐ ┌──▼─────┐ ┌──▼─────┐
│ Sub-Agent 1 │ │ Sub-Agent 2│ │ Sub-A 3│ │ Sub-A 4│
│ Twitter/X   │ │ News Sites │ │ Reddit │ │ HN     │
│ Search      │ │ Scraping   │ │ Trends │ │ Posts  │
└─────────────┘ └────────────┘ └────────┘ └────────┘
       │               │           │          │
       └───────┬───────┴───────────┴──────────┘
               │
       ┌───────▼────────────────────────────┐
       │   Structured Output Collection     │
       │   - Key findings                   │
       │   - Notable opinions               │
       │   - Source links                   │
       │   - Patterns                       │
       │   - Gaps                           │
       └────────────────────────────────────┘

Why This Works

Parallel execution:

  • 4 sources searched simultaneously
  • 4x faster than sequential
  • More comprehensive coverage

Structured output:

  • Consistent format from each sub-agent
  • Easy to synthesize
  • Actionable insights

Gap analysis:

  • Identifies underreported angles
  • Finds emerging trends early
  • Creates unique value

Implementation

Step 1: Define Structured Output Format

Each sub-agent must return:

## Key Findings
- [Finding 1 with context]
- [Finding 2 with context]
- [Finding 3 with context]

## Notable Opinions
- [Opinion 1] - [Source/Author]
- [Opinion 2] - [Source/Author]

## Source Links
- [Title 1](URL) - Brief description
- [Title 2](URL) - Brief description

## Patterns Observed
- [Pattern 1 across multiple sources]
- [Pattern 2 across multiple sources]

## Gaps Identified
- [What nobody is discussing yet]
- [Underreported angle]

Step 2: Create Research Skill

Research Blueprint Architecture

---
name: parallel_research
description: Conducts parallel research across multiple sources with gap analysis
---

# Parallel Research Skill

## Orchestration Strategy

You are the orchestrator and planner. Your job:
1. Plan the research operation
2. Spawn parallel sub-agents
3. Collect structured outputs
4. Synthesize findings
5. Identify gaps and opportunities

## Sub-Agent Deployment

### Sub-Agent 1: Twitter/X Intelligence
**Task:** Search Twitter/X for discussions about [topic]
**Sources:** 
- Monitored accounts: @account1, @account2, @account3
- Hashtags: #relevant, #tags
- Time range: Last 24 hours

**Output format:** Use structured output template

### Sub-Agent 2: News Aggregation
**Task:** Search news sites for [topic] coverage
**Sources:**
- TechCrunch, The Verge, Ars Technica
- Industry-specific publications
- Time range: Last 7 days

**Output format:** Use structured output template

### Sub-Agent 3: Reddit Analysis
**Task:** Search relevant subreddits for [topic] discussions
**Sources:**
- r/MachineLearning, r/LocalLLaMA, r/OpenAI
- Sort by: Hot, Top (week)

**Output format:** Use structured output template

### Sub-Agent 4: Hacker News
**Task:** Search Hacker News for [topic] submissions and comments
**Sources:**
- Front page
- Recent submissions
- Top comments

**Output format:** Use structured output template

## Synthesis Phase

After all sub-agents report back, create structured document:

### Executive Summary
- 3-5 sentence overview
- Most important finding
- Key recommendation

### Key Themes and Patterns
- Theme 1: [Description]
  - Mentioned by: [Sources]
  - Significance: [Why it matters]
  
- Theme 2: [Description]
  - Mentioned by: [Sources]
  - Significance: [Why it matters]

### Opportunities and Gaps
- Gap 1: [What nobody is covering]
  - Why it matters: [Explanation]
  - Opportunity: [How to capitalize]
  
- Gap 2: [Underreported angle]
  - Why it matters: [Explanation]
  - Opportunity: [How to capitalize]

### All Source Links
Organized by category with annotations

## Quality Checks

Before delivering:
- [ ] All sub-agents completed successfully
- [ ] At least 3 gaps identified
- [ ] All sources linked and verified
- [ ] Patterns identified across multiple sources
- [ ] Executive summary is actionable

Step 3: Automation with Cron Jobs

Frequency consideration:

*Hourly (0 * * * ):

  • ❌ Too frequent for most sources
  • Many feeds show "no items in last hour"
  • Sparse results, wasted API calls

**Every 6 hours (0 /6 * * ):

  • ✅ Optimal for news/trend monitoring
  • Balances freshness with content density
  • Enough new content to analyze

*Daily (0 8 * * ):

  • ✅ Good for comprehensive reports
  • Full day of content to analyze
  • Better for pattern identification

Recommendation: Start with 6-hour intervals, adjust based on data availability

Step 4: Cron Job Setup

Create a cron job that runs every 6 hours:
0 */6 * * *

Task: Execute skill parallel_research with topic "AI developments"

Deliverables:
- Structured research document
- Save to memory/research/YYYY-MM-DD-HH.md
- Send summary to Discord #research channel

Advanced Patterns

Pattern 1: Trend Discovery

Goal: Identify emerging trends before they go mainstream

Strategy:

## Sub-Agent Focus Areas

1. **Early Signals Agent**
   - GitHub trending repositories
   - Academic paper preprints (arXiv)
   - Small but growing subreddits

2. **Mainstream Agent**
   - Major news outlets
   - Popular Twitter accounts
   - Front page of HN

3. **Gap Analysis**
   - Compare early signals vs. mainstream
   - Identify: What's trending in early channels but not mainstream yet?
   - Opportunity: Cover these topics before everyone else

Pattern 2: Competitive Intelligence

Goal: Track competitor activity and market positioning

Strategy:

## Sub-Agent Focus Areas

1. **Product Updates Agent**
   - Competitor blogs
   - Product hunt launches
   - GitHub releases

2. **Pricing Agent**
   - Pricing page changes (via web scraping)
   - Promotional campaigns
   - Discount patterns

3. **Sentiment Agent**
   - Customer reviews
   - Social media mentions
   - Support forum discussions

4. **Gap Analysis**
   - Features competitors lack
   - Underserved customer segments
   - Pricing opportunities

Pattern 3: Content Opportunity Finder

Goal: Find content angles with high demand, low supply

Strategy:

## Sub-Agent Focus Areas

1. **Question Mining Agent**
   - Reddit questions
   - Stack Overflow
   - Quora

2. **Content Supply Agent**
   - Existing blog posts
   - YouTube videos
   - Documentation

3. **Gap Analysis**
   - High-demand questions with few quality answers
   - Topics with outdated content
   - Underserved niches

Structured Output Best Practices

1. Enforce Format Consistency

In sub-agent instructions:

CRITICAL: You MUST use this exact format:

## Key Findings
- [Finding with context]

## Notable Opinions
- [Opinion] - [Source]

## Source Links
- [Title](URL) - Description

## Patterns Observed
- [Pattern across sources]

## Gaps Identified
- [What nobody is discussing]

Do not deviate from this format.

2. Validate Sub-Agent Output

Orchestrator checks:

## Output Validation

For each sub-agent response:
1. Verify all sections present
2. Check minimum content:
   - At least 3 key findings
   - At least 2 notable opinions
   - At least 5 source links
   - At least 2 patterns
   - At least 1 gap
3. Validate source links are accessible
4. Ensure no placeholder text

If validation fails: Request sub-agent retry with specific feedback

3. Handle Failures Gracefully

Failure scenarios:

Sub-agent timeout:

If Sub-Agent 2 times out:
- Continue with other sub-agents
- Note missing data source in final report
- Attempt retry in background

API rate limit:

If Twitter API rate limited:
- Use cached data if available
- Note limitation in report
- Schedule retry for next cycle

No results found:

If Sub-Agent 3 finds no relevant content:
- Document as "no activity in this channel"
- May indicate emerging trend (too early for Reddit)
- Include in gap analysis

Synthesis Techniques

Cross-Source Pattern Recognition

Technique:

## Pattern Identification Algorithm

1. Extract key terms from each sub-agent
2. Count frequency across sources
3. Identify clusters:
   - High frequency = mainstream topic
   - Medium frequency = emerging trend
   - Low frequency = early signal or noise

4. Analyze sentiment:
   - Positive across sources = consensus
   - Mixed sentiment = controversy
   - Negative = potential risk

5. Track over time:
   - Increasing mentions = growing trend
   - Decreasing mentions = fading interest
   - Stable mentions = established topic

Gap Analysis Framework

Framework:

## Gap Types

### Type 1: Underreported Developments
- Significant event with minimal coverage
- Opportunity: Be first to cover comprehensively

### Type 2: Missing Perspectives
- Topic covered, but only from one angle
- Opportunity: Provide alternative viewpoint

### Type 3: Unanswered Questions
- Community asking questions, no good answers
- Opportunity: Create definitive resource

### Type 4: Emerging Trends
- Early signals, not yet mainstream
- Opportunity: Position as thought leader

### Type 5: Contradictions
- Sources disagree, no resolution
- Opportunity: Investigate and clarify

Actionable Insights Generation

Transform findings into actions:

Bad synthesis:

GPT-5 was announced. Many people are discussing it.

Good synthesis:

GPT-5 announcement generating 3x normal discussion volume.

Key insight: Focus is on reasoning capabilities, not speed.

Opportunity: Create content comparing reasoning benchmarks 
across models. Gap: Nobody has done comprehensive comparison yet.

Action: Produce "GPT-5 vs Claude Opus 4.7 Reasoning Benchmark" 
within 48 hours to capture search traffic.

Automation Strategy

Frequency Optimization

Data-driven approach:

Week 1: Test multiple frequencies

Hourly:   0 * * * *
6-hour:   0 */6 * * *
Daily:    0 8 * * *

Week 2: Analyze results

Hourly:   30% of runs found <5 new items (too frequent)
6-hour:   85% of runs found 15-30 new items (optimal)
Daily:    100% found 50+ items (good for comprehensive reports)

Week 3: Optimize

Keep: 6-hour for real-time monitoring
Keep: Daily for comprehensive reports
Remove: Hourly (too sparse)

Notification Strategy

Tiered notifications:

Critical findings (immediate):

- Major competitor announcement
- Significant market shift
- Breaking news in your domain
→ Send to Discord/Slack immediately

Important findings (digest):

- Emerging trends
- Notable opinions
- Pattern changes
→ Include in next scheduled report

Routine findings (archive):

- Expected updates
- Minor news
- Background information
→ Save to memory, no notification

Real-World Example

Use Case: AI News Monitoring

Goal: Stay ahead of AI developments for content creation

Setup:

Cron job:

0 */6 * * * Execute skill: ai_research_pipeline

Sub-agents:

  1. Twitter: Monitor @OpenAI, @AnthropicAI, @GoogleAI, etc.
  2. News: TechCrunch, VentureBeat, The Verge
  3. Reddit: r/MachineLearning, r/LocalLLaMA
  4. HN: AI-related submissions

Output:

# AI Research Report - 2026-05-06 12:00

## Executive Summary
GPT-5 reasoning capabilities generating significant discussion.
Key gap: No comprehensive benchmark comparison exists yet.
Opportunity: Create definitive comparison within 48 hours.

## Key Themes
1. **Reasoning Capabilities** (mentioned 47 times)
   - GPT-5 claims 95% on complex reasoning tasks
   - Community skeptical, wants independent verification
   - Gap: No third-party benchmarks published yet

2. **Cost Concerns** (mentioned 23 times)
   - GPT-5 pricing not announced
   - Speculation: 2-3x GPT-4 pricing
   - Gap: No cost-benefit analysis available

## Opportunities
1. **Benchmark Comparison** (HIGH PRIORITY)
   - Nobody has published GPT-5 vs Claude Opus 4.7 reasoning tests
   - High search demand expected
   - Action: Produce within 48 hours

2. **Cost Analysis** (MEDIUM PRIORITY)
   - Once pricing announced, immediate demand for ROI analysis
   - Action: Prepare framework now, publish when pricing drops

## All Sources (67 total)
[Organized list with annotations]

Result: Actionable intelligence delivered every 6 hours

Troubleshooting

"Sub-agents returning inconsistent formats"

Cause: Instructions not explicit enough

Solution:

Add to sub-agent prompt:
"CRITICAL: Use EXACTLY this format. Do not add extra sections.
Do not skip sections. Do not reorder sections."

Include example output in prompt.

"Gap analysis is generic"

Cause: Not comparing across sources effectively

Solution:

Add explicit gap analysis instructions:
"Compare findings across all sub-agents. Identify:
1. What Source A mentioned but Sources B, C, D did not
2. Questions asked but not answered
3. Contradictions between sources
4. Topics with high engagement but low content"

"Too much data, synthesis is overwhelming"

Cause: Sub-agents returning too much information

Solution:

Add filtering criteria:
"Return only findings that meet these criteria:
- Mentioned by multiple sources (pattern)
- High engagement (>100 upvotes/likes)
- Published within last 24 hours (fresh)
- Relevant to [specific focus area]"

Best Practices

  1. Start with 2-3 sub-agents - Scale up after validating pattern
  2. Enforce structured output - Makes synthesis 10x easier
  3. Focus on gaps - That's where unique value comes from
  4. Automate frequency based on data - Don't guess, measure
  5. Iterate on synthesis - Refine prompts based on output quality
  6. Archive everything - Historical data reveals long-term trends

Related Resources


Duration: 1 minute
Difficulty: Advanced
Video Reference: This AI Subagent Blueprint

Tags

advanced discord
Back to Guides