17 KiB

Raw Blame History

Story 4.4: Custom AI Models & Enhanced Markdown Export

Story Overview

Story ID: 4.4
Epic: 4 - Advanced Intelligence & Developer Platform
Title: Custom AI Models & Enhanced Markdown Export
Status: 📋 READY FOR IMPLEMENTATION
Priority: High

Goal: Implement custom AI model configurations with enhanced markdown export featuring executive summaries, timestamped sections with clickable navigation, and professional formatting.

Value Proposition: Provide professional export format with executive-level insights and improved navigation, while offering custom prompt templates and model parameter configurations for domain-specific summarization needs.

Dependencies:

✅ Story 4.2 (API Endpoints & Developer SDK) - Complete
✅ Existing export system foundation
✅ AI service infrastructure

Estimated Effort: 32 hours (enhanced with export features)

Technical Requirements

Core Features

1. Executive Summary Generation

2-3 Paragraph Overview: Concise executive summary at top of all exports
Key Metrics: Video duration, word count, main topics, sentiment analysis
Decision-Maker Focus: Business value, ROI implications, action items
Executive Language: Professional tone suitable for leadership consumption

2. Timestamped Sections Enhancement

Format: [HH:MM:SS] Section Title with clickable navigation
Semantic Segmentation: Intelligent topic-based section detection
Jump-to-Video: Links that open YouTube video at specific timestamps
Section Summaries: Brief summary for each timestamped section
Progress Indicators: Visual progress through video content

3. Enhanced Markdown Structure

Table of Contents: Auto-generated with timestamp links
Hierarchical Sections: Nested structure following video content flow
Improved Formatting: Professional typography with consistent styling
Metadata Header: Video info, analysis date, processing details
Footer: Analysis metadata and quality indicators

4. Custom Prompt Template Management

Template Library: Predefined templates for different use cases
Template Editor: Web interface for creating custom prompt templates
Parameter Configuration: Temperature, token limits, model selection
Template Versioning: Track changes and performance metrics
Sharing System: Public/private template sharing

5. Domain-Specific Presets

Educational: Focus on learning objectives, key concepts, exercises
Business: Emphasize ROI, market implications, strategic insights
Technical: Highlight implementation details, tools, architecture
Content Creation: Analyze engagement patterns, audience insights
Research: Academic focus with citations and methodology

6. A/B Testing Framework

Prompt Optimization: Test different prompt variations
Model Comparison: Compare outputs across different AI models
Quality Metrics: Automated quality scoring and user feedback
Performance Analytics: Processing time, cost, accuracy tracking
Statistical Analysis: Confidence intervals and significance testing

Technical Architecture

Enhanced Export Pipeline

class EnhancedExportService:
    def __init__(self):
        self.executive_generator = ExecutiveSummaryGenerator()
        self.timestamp_processor = TimestampProcessor()  
        self.markdown_formatter = MarkdownFormatter()
        self.template_manager = TemplateManager()
    
    async def generate_enhanced_export(
        self, 
        summary_id: str, 
        template_id: Optional[str] = None,
        export_config: ExportConfig = None
    ) -> EnhancedMarkdownExport:
        # Generate executive summary
        # Process timestamps and create navigation
        # Apply custom formatting and template
        # Return structured export with metadata

Database Schema Extensions

-- Custom prompt templates
CREATE TABLE prompt_templates (
    id UUID PRIMARY KEY,
    user_id UUID REFERENCES users(id),
    name VARCHAR(200),
    description TEXT,
    prompt_text TEXT,
    domain_category VARCHAR(50), -- 'educational', 'business', 'technical', etc.
    model_config JSONB, -- temperature, max_tokens, etc.
    is_public BOOLEAN DEFAULT FALSE,
    usage_count INTEGER DEFAULT 0,
    rating FLOAT DEFAULT 0.0,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

-- A/B testing experiments  
CREATE TABLE prompt_experiments (
    id UUID PRIMARY KEY,
    name VARCHAR(200),
    description TEXT,
    baseline_template_id UUID REFERENCES prompt_templates(id),
    variant_template_id UUID REFERENCES prompt_templates(id),
    status VARCHAR(20) DEFAULT 'active', -- 'active', 'completed', 'paused'
    success_metric VARCHAR(50), -- 'quality_score', 'user_rating', 'processing_time'
    statistical_significance FLOAT,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

-- Enhanced export metadata
CREATE TABLE export_metadata (
    id UUID PRIMARY KEY,
    summary_id UUID REFERENCES summaries(id),
    template_id UUID REFERENCES prompt_templates(id),
    export_type VARCHAR(20), -- 'markdown', 'pdf', 'json'
    executive_summary TEXT,
    section_count INTEGER,
    timestamp_count INTEGER,
    processing_time_seconds FLOAT,
    quality_score FLOAT,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

-- Section detection and timestamps
CREATE TABLE summary_sections (
    id UUID PRIMARY KEY,
    summary_id UUID REFERENCES summaries(id),
    section_index INTEGER,
    title VARCHAR(300),
    start_timestamp INTEGER, -- seconds
    end_timestamp INTEGER,
    content TEXT,
    summary TEXT,
    key_points JSONB,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

Implementation Tasks

Task 4.4.1: Executive Summary Generation (8 hours)

Subtasks:

Executive Summary Generator (4 hours)
- Create ExecutiveSummaryGenerator service class
- Implement professional executive summary prompts
- Extract key metrics (duration, topics, sentiment)
- Generate business value and action items
- Test summary quality and executive readability
Metadata Collection Service (2 hours)
- Extract video metadata (duration, views, publish date)
- Calculate transcript statistics (word count, reading time)
- Perform sentiment analysis on content
- Generate quality and confidence scores
- Create metadata header for exports
Executive Template System (2 hours)
- Create executive summary template variations
- Implement length controls (brief, standard, detailed)
- Add business context detection and emphasis
- Create executive language style guide
- Test with various video content types

Subtasks:

Semantic Section Detection (5 hours)
- Implement topic segmentation algorithm using transcript
- Identify natural section breaks in video content
- Extract meaningful section titles from content
- Handle various video formats (lectures, tutorials, discussions)
- Validate section quality and coherence
Timestamp Processing Engine (4 hours)
- Convert transcript timestamps to [HH:MM:SS] format
- Generate clickable YouTube links with timestamp parameters
- Create section navigation structure
- Implement deep-linking to video positions
- Add timestamp validation and error handling
Table of Contents Generator (3 hours)
- Auto-generate markdown table of contents
- Create hierarchical section structure
- Add timestamp links for each section
- Implement progress indicators and visual elements
- Test navigation functionality across different markdown renderers

Task 4.4.3: Enhanced Markdown Formatting (6 hours)

Subtasks:

Professional Markdown Formatter (3 hours)
- Create enhanced markdown template system
- Implement consistent typography and styling
- Add metadata headers and footers
- Create professional document structure
- Test with various markdown parsers and renderers
Section Content Enhancement (2 hours)
- Generate brief summaries for each timestamped section
- Add key points and takeaways per section
- Implement content hierarchy and flow
- Create section transitions and connections
- Validate content quality and readability
Export Quality Control (1 hour)
- Implement markdown validation and quality checks
- Add automated formatting consistency verification
- Create export preview and validation system
- Test with different export destinations (GitHub, Notion, etc.)
- Add quality scoring for generated exports

Task 4.4.4: Custom Prompt Template System (6 hours)

Subtasks:

Template Management Backend (3 hours)
- Create TemplateManager service with CRUD operations
- Implement template versioning and history tracking
- Add template validation and security checks
- Create template sharing and permissions system
- Build template performance analytics
Template Editor Frontend (2 hours)
- Create template creation/editing interface
- Add preview functionality for prompt testing
- Implement parameter configuration (temperature, tokens)
- Create template library browser with categories
- Add template rating and feedback system
Domain-Specific Presets (1 hour)
- Create predefined templates for each domain
- Educational preset: learning objectives, key concepts
- Business preset: ROI analysis, strategic insights
- Technical preset: implementation details, architecture
- Test preset effectiveness across different content types

Data Models

Enhanced Export Models

from pydantic import BaseModel
from typing import List, Dict, Optional, Any
from datetime import datetime
from enum import Enum

class ExportFormat(str, Enum):
    MARKDOWN = "markdown"
    PDF = "pdf"
    JSON = "json"
    HTML = "html"

class DomainCategory(str, Enum):
    EDUCATIONAL = "educational"
    BUSINESS = "business"
    TECHNICAL = "technical"
    CONTENT_CREATION = "content_creation"
    RESEARCH = "research"
    GENERAL = "general"

class ExecutiveSummary(BaseModel):
    overview: str
    key_metrics: Dict[str, Any]
    main_topics: List[str]
    business_value: Optional[str] = None
    action_items: List[str]
    sentiment_analysis: Dict[str, float]

class TimestampedSection(BaseModel):
    index: int
    title: str
    start_timestamp: int  # seconds
    end_timestamp: int
    youtube_link: str
    content: str
    summary: str
    key_points: List[str]

class PromptTemplate(BaseModel):
    id: str
    name: str
    description: str
    prompt_text: str
    domain_category: DomainCategory
    model_config: Dict[str, Any]
    is_public: bool
    usage_count: int
    rating: float
    created_at: datetime
    updated_at: datetime

class ExportConfig(BaseModel):
    format: ExportFormat
    include_executive_summary: bool = True
    include_timestamps: bool = True
    include_toc: bool = True
    section_detail_level: str = "standard"  # brief, standard, detailed
    custom_template_id: Optional[str] = None

class EnhancedMarkdownExport(BaseModel):
    summary_id: str
    executive_summary: ExecutiveSummary
    table_of_contents: List[str]
    sections: List[TimestampedSection]
    markdown_content: str
    metadata: Dict[str, Any]
    quality_score: float
    processing_time_seconds: float
    template_used: Optional[PromptTemplate] = None

Testing Strategy

Unit Tests

Executive Summary Tests: Quality and business value extraction
Timestamp Processing Tests: Format validation and link generation
Section Detection Tests: Topic segmentation accuracy
Template System Tests: CRUD operations and validation
Markdown Formatter Tests: Output quality and consistency

Integration Tests

Export Pipeline Tests: End-to-end enhanced export generation
Template Usage Tests: Custom template application to summaries
API Integration Tests: All new export and template endpoints
Database Tests: Template storage and retrieval operations

Quality Assurance Tests

Executive Summary Quality: Business relevance and actionability
Navigation Functionality: Timestamp links and table of contents
Template Effectiveness: Domain-specific preset performance
Export Consistency: Output quality across different content types

API Specification

Enhanced Export Endpoints

/api/export/{summary_id}/enhanced:
  post:
    summary: Generate enhanced markdown export with executive summary
    parameters:
      - name: summary_id
        in: path
        required: true
        schema:
          type: string
    requestBody:
      required: true
      content:
        application/json:
          schema:
            $ref: '#/components/schemas/ExportConfig'
    responses:
      200:
        content:
          application/json:
            schema:
              $ref: '#/components/schemas/EnhancedMarkdownExport'

/api/templates:
  get:
    summary: List available prompt templates
    parameters:
      - name: domain
        in: query
        schema:
          type: string
          enum: [educational, business, technical, content_creation, research]
      - name: public_only
        in: query
        schema:
          type: boolean
          default: false
    responses:
      200:
        content:
          application/json:
            schema:
              type: array
              items:
                $ref: '#/components/schemas/PromptTemplate'
  post:
    summary: Create new prompt template
    requestBody:
      required: true
      content:
        application/json:
          schema:
            $ref: '#/components/schemas/PromptTemplate'
    responses:
      201:
        content:
          application/json:
            schema:
              $ref: '#/components/schemas/PromptTemplate'

Success Criteria

Functional Requirements ✅

Executive summary generation provides 2-3 paragraph professional overview
Timestamped sections use [HH:MM:SS] format with clickable YouTube links
Table of contents auto-generates with hierarchical navigation structure
Custom prompt templates support creation, editing, and sharing
Domain-specific presets available for 5 different categories
Enhanced markdown exports maintain professional formatting and consistency

Quality Requirements ✅

Executive summaries focus on business value and actionable insights
Section detection creates logical content segments (accuracy >85%)
Timestamp links correctly jump to specific video positions
Custom templates produce measurably different output characteristics
Export formatting renders correctly across major markdown platforms
Template performance tracking shows usage analytics and effectiveness

Performance Requirements ✅

Executive summary generation completes in under 15 seconds
Enhanced export processing under 30 seconds for standard videos
Template application adds less than 10 seconds to summary generation
Section detection handles videos up to 3 hours in length
Template editor provides real-time preview with <2 second latency

Implementation Notes

Executive Summary Best Practices

Focus on business value and strategic implications
Include quantitative metrics where possible (duration, key statistics)
Provide clear action items and next steps
Use executive language appropriate for decision-makers
Keep length between 150-300 words for optimal readability

Timestamp Processing Guidelines

Detect natural section breaks using topic modeling
Create meaningful section titles that summarize content
Ensure timestamp accuracy within 5-second tolerance
Generate YouTube deep links with proper timestamp parameters
Handle edge cases (missing timestamps, very short sections)

Template System Design

Implement template versioning for tracking changes
Add validation to prevent malicious prompt injection
Create template effectiveness scoring based on user feedback
Support template inheritance and composition
Enable collaborative template development

Markdown Enhancement Patterns

Use consistent heading hierarchy (H1 for title, H2 for sections)
Add metadata blocks at beginning and end of documents
Implement responsive table of contents for different viewers
Create professional typography with proper spacing and formatting
Test compatibility with GitHub, Notion, Obsidian, and other platforms

Risk Mitigation

High Risk: Executive Summary Quality

Risk: Generic summaries that don't provide executive value
Mitigation: Business-focused prompts, quality scoring, executive review feedback

Medium Risk: Section Detection Accuracy

Risk: Poor section breaks or meaningless titles
Mitigation: Topic modeling validation, manual override options, quality thresholds

Medium Risk: Template Complexity

Risk: Template system too complex for users to adopt
Mitigation: Simple editor interface, comprehensive presets, guided template creation

Story Owner: Development Team
Architecture Reference: BMad Method Epic-Story Structure
Implementation Status: Ready for Development
Last Updated: 2025-08-27

17 KiB Raw Blame History

Story 4.4: Custom AI Models & Enhanced Markdown Export

Story Overview

Technical Requirements

Core Features

1. Executive Summary Generation

2. Timestamped Sections Enhancement

3. Enhanced Markdown Structure

4. Custom Prompt Template Management

5. Domain-Specific Presets

6. A/B Testing Framework

Technical Architecture

Enhanced Export Pipeline

Database Schema Extensions

Implementation Tasks

Task 4.4.1: Executive Summary Generation (8 hours)

Subtasks:

Task 4.4.2: Timestamped Sections & Navigation (12 hours)

Subtasks:

Task 4.4.3: Enhanced Markdown Formatting (6 hours)

Subtasks:

Task 4.4.4: Custom Prompt Template System (6 hours)

Subtasks:

Data Models

Enhanced Export Models

Testing Strategy

Unit Tests

Integration Tests

Quality Assurance Tests

API Specification

Enhanced Export Endpoints

Success Criteria

Functional Requirements ✅

Quality Requirements ✅

Performance Requirements ✅

Implementation Notes

Executive Summary Best Practices

Timestamp Processing Guidelines

Template System Design

Markdown Enhancement Patterns

Risk Mitigation

High Risk: Executive Summary Quality

Medium Risk: Section Detection Accuracy

Medium Risk: Template Complexity

17 KiB

Raw Blame History