Files
net/docs/INVESTOR_BRIEF.md
cawcenter 4c632b6229 📚 Comprehensive Documentation Suite (Mil-Spec/IBM Redbook Style)
INVESTOR DOCUMENTATION:
- INVESTOR_BRIEF.md: Executive summary, capacity metrics, architecture overview
- PLATFORM_CAPABILITIES.md: 5 modules, 27 subcomponents catalog
- TECHNICAL_ARCHITECTURE.md: Mermaid diagrams, data flows, extension points

CTO/SENIOR DEV ONBOARDING:
- CTO_ONBOARDING.md: System overview, security model, operational runbook
- DEVELOPER_GUIDE.md: Clone→Run→Deploy workflow, debugging
- API_REFERENCE.md: 30+ endpoints with request/response specs
- DATABASE_SCHEMA.md: 30+ tables in Harris Matrix order
- COMPONENT_LIBRARY.md: 182 React components catalog

REFERENCE:
- GLOSSARY.md: 70+ platform terms defined
- ADMIN_PAGES_GUIDE.md: 66 admin pages by module

Style: BLUF format, zero marketing fluff, high-density tables
2025-12-14 13:58:28 -05:00

7.9 KiB
Raw Permalink Blame History

INVESTOR BRIEF: Spark Platform

BLUF: Spark is a multi-tenant content scaling platform that generates location-targeted SEO articles using spintax permutation and Cartesian pattern matching. Current capacity: 50,000+ unique articles per campaign configuration.


1. Platform Function

Spark automates SEO content production at scale. The system:

  1. Ingests buyer personas (Avatars), location data, and content templates
  2. Computes Cartesian products of location × persona × offer variations
  3. Generates unique articles with geo-specific and persona-specific content
  4. Manages multi-site content distribution with scheduling controls

Core Problem Solved: Manual SEO content creation produces ~2-5 articles/day. Spark produces 100-500 articles/hour with equivalent uniqueness and targeting precision.


2. Technical Stack

Layer Technology Version Purpose
Frontend Astro 4.7 SSR + Islands Architecture
UI React 18.3 Interactive components
Backend Directus 11 Headless CMS + REST/GraphQL
Database PostgreSQL 16 Primary data store
Extensions PostGIS 3.4 Geographic queries
Cache Redis 7 Session + job queue backing
Queue BullMQ - Async job processing
Deployment Coolify - Docker orchestration

Infrastructure: Self-hostable via Docker Compose. No external SaaS dependencies.


3. Data Assets

3.1 Location Database

Dataset Count Source
US States 51 Census data
US Counties 3,143 Census data
US Cities ~50 per county Population-ranked

3.2 Intelligence Assets

Asset Type Description IP Value
Avatar Intelligence 10+ buyer personas with psychographics Proprietary
Wealth Clusters 5+ economic profile groupings Proprietary
Spintax Dictionaries Word/phrase variation libraries Proprietary
Cartesian Patterns Title/hook formula combinations Proprietary
Offer Blocks 10+ promotional templates Proprietary

4. Capacity Metrics

4.1 Theoretical Maximum

10 Avatars × 10 Niches × 50 Cities × 10 Offers = 50,000 unique articles

4.2 Practical Campaign

2 Avatars × 3 Niches × 10 Cities × 2 Offers = 120 articles

4.3 Generation Speed

Operation Throughput
Headline permutation 1,000/second
Article assembly 100-500/hour
Queue processing Configurable batch size

5. Architecture Overview

┌─────────────────────────────────────────────────────────────────┐
│                        USER REQUEST                              │
└──────────────────────────┬──────────────────────────────────────┘
                           ▼
┌─────────────────────────────────────────────────────────────────┐
│  TRAEFIK (Reverse Proxy)                                        │
│  Routes by domain/path to containers                            │
└──────────────────────────┬──────────────────────────────────────┘
                           ▼
        ┌──────────────────┴──────────────────┐
        ▼                                     ▼
┌───────────────┐                    ┌───────────────┐
│  FRONTEND     │                    │  DIRECTUS     │
│  Astro SSR    │◄──── REST API ────►│  Port 8055    │
│  Port 4321    │                    │  Headless CMS │
└───────────────┘                    └───────┬───────┘
                                             │
                                             ▼
                         ┌───────────────────────────────────┐
                         │  POSTGRESQL 16 + POSTGIS          │
                         │  30+ Collections                  │
                         │  Harris Matrix Schema Order       │
                         └───────────────────┬───────────────┘
                                             │
                                             ▼
                         ┌───────────────────────────────────┐
                         │  REDIS 7                          │
                         │  Session Cache + BullMQ Jobs      │
                         └───────────────────────────────────┘

6. Multi-Tenancy Model

Isolation Level Implementation
Data site_id FK on all content tables
Routes Domain-based routing via Traefik
Authentication Directus role-based access control
Permissions Site-scoped API tokens

Tenant capacity: Unlimited sites. Horizontal scaling via Docker replicas.


7. Feature Inventory

7.1 Intelligence Library (Data Management)

  • Avatar Intelligence Manager
  • Geo Intelligence Map (Leaflet integration)
  • Spintax Dictionary Manager
  • Cartesian Pattern Builder

7.2 Content Factory (Production)

  • Kanban workflow board
  • Campaign wizard (geo + spintax modes)
  • Jobs queue with progress monitoring
  • Scheduler with Gaussian distribution

7.3 Launchpad (Site Builder)

  • Multi-site management
  • Block-based page builder
  • Navigation editor
  • Theme customization

7.4 SEO Engine (Optimization)

  • Headline generation (spintax permutation)
  • Fragment assembly (6-pillar structure)
  • Internal link insertion
  • Duplicate content detection
  • Sitemap drip scheduling

7.5 Analytics (Tracking)

  • Pageview tracking
  • Event tracking
  • Conversion tracking
  • Dashboard with aggregations

8. Competitive Position

Capability Spark Generic CMS AI Writers
Multi-tenant Partial
Geo-targeting ✓ (3,143 counties)
Persona-targeting ✓ (10+ avatars) Limited
Spintax processing
Cartesian patterns
Self-hostable Varies
Queue-based scaling

9. Revenue Model Indicators

Model Mechanism
SaaS per site Monthly fee per managed site
Volume pricing Tiered by articles generated
Enterprise Self-hosted license + support
White-label Reseller partnerships

10. Development Status

Milestone Status Description
M1: Intelligence Library ✓ Complete Full CRUD + stats
M2: Content Factory ✓ Complete Kanban + queue
M3: All Collections ✓ Complete 30+ schemas
M4: Launchpad ✓ Complete Site builder
M5: Production ✓ Deployed Live on Coolify

Current Status: Operational. Active development on feature enhancements.


11. Key Files

File Purpose
complete_schema.sql Golden Schema (Harris Matrix ordered)
docker-compose.yaml Infrastructure definition
frontend/src/lib/schemas.ts TypeScript type definitions
frontend/src/pages/api/* 30+ API endpoints