Selected AI engineering work — production AI platform, customer-facing pursuit microsite pattern, token-efficient Claude Code corpus design, and supporting primitives.
Sanitized for public publication. Customer names, internal product names, employee names, and specific deal amounts removed. Patterns and methods preserved.
Public résumé: /michael-tuszynski/ · Blog: The Cloud Codex
What I build
Production AI systems for engineering and revenue teams — focusing on versioned, auditable platforms with guardrails, evaluations, and real adoption rather than demos. Current platform work codifies a Fortune-1000 professional services org's pursuit lifecycle as an internal AI platform: a Claude Code plugin marketplace orchestrating Salesforce, Microsoft 365, OneDrive, Anthropic Claude, and AWS Bedrock.
AI engineering platform (in production, 9+ months)
An AI-native operating system for enterprise presales pursuits covering discovery, qualification, statement-of-work lifecycle, risk review, and engagement handoff on a single context store.
Scale at last cut:
- 11 plugins, 60+ slash commands, 562 passing tests
- ~51K LOC TypeScript
- Multi-provider model routing (Anthropic Claude primary, AWS Bedrock fallback) with env-var toggle
- Prompt-contract framework with versioned input/output schemas
- Evidence-citation guardrails: High/Medium/Low confidence + dated source URLs required on every claim
- Session-start hooks hot-loading operational knowledge from synced channels
- Prompt caching for shared guardrails and knowledge base
- Per-command rule contracts; provider switching via environment variable
Outcomes (anonymized):
- Compressed manual artifact production hours into command-driven workflows
- Enabled $0.5M+ and $1M+ enterprise engagements to close on platform-authored statements of work with iterative revisions
- Partner-funded engagement program added co-funding to a pursuit
- Productized AI Well-Architected workshop at small-five-figure list price; first paying customer in flight
The pattern: start shipping custom for one customer, then extract patterns into a framework for ten field engineers.
Customer-facing pursuit microsite pattern
Built a customer-facing pursuit microsite in days for an enterprise AI agent engagement in K-12 EdTech using Claude-on-platform. The microsite anchored follow-up conversations with the client's CEO and hyperscaler co-sell partner; the pursuit moved from ideation workshop to verbal Phase-1 commitment in three weeks.
Pattern observations:
- AI-listener-agent architecture on AWS Bedrock + ECS Fargate behind WebSocket API Gateway, with OpenSearch Serverless for RAG and Transcribe/Polly for voice
- POC built live in ideation workshop (single-file Flask + Anthropic, ~3,700 lines, eight views) served as conversation piece pulling stakeholders into architecture discussions faster than slides
- Microsite + POC + workshop collapsed what used to be three serial deliverables into parallel motion
The artifact pattern is portable: AI pursuit microsites are cheap to build, land differently than decks, and serve as persistent reference points through Phase-1/Phase-2 sequencing.
Token-efficient Claude Code corpus design
A reproducible methodology for keeping a multi-plugin Claude Code corpus lean. Took a production corpus from 221K tokens → 179K tokens (19% reduction) without removing procedural steps or contract terms.
The numbers:
| Pool | Before | After | Saved | % |
|---|---|---|---|---|
| Commands (per invocation) | 146,105 | 111,140 | −34,965 | −23.9% |
| Rules (auto-load every session) | 18,375 | 14,478 | −3,897 | −21.2% |
| Skills (per invocation) | 56,620 | 53,815 | −2,805 | −5.0% |
| Total | 221,100 | 179,433 | −41,667 | −18.8% |
Token counts via tiktoken / cl100k_base — close proxy for Anthropic tokenization.
Why it matters:
- Context-window headroom on every command
- Faster first-token latency
- Cleaner prompts drive better instruction adherence
- Linear scaling — doubled users means doubled savings
Seven patterns that drove the savings:
- Rules dedup is the cheapest 4K tokens you'll find. Multiple plugins carried byte-identical context files. Collapsing into a shared
core/rules/directory saved ~3.9K tokens loading in every session. - The top decile of commands held 42% of the command corpus. Token distribution is heavily right-skewed. Sort by tokens descending and trim the top decile.
- Pipeline-banner repetition is invisible but expensive. Step counters like
> **Step 3 of 5**repeated at the top AND inside every subsection. Once at the top is enough. Saved ~600–1,200 tokens per affected command. - Auto-resolve fallback boilerplate copy-pastes faster than it earns. Six-bullet "if exactly one... if zero, stop and ask... if multiple, stop and ask..." blocks collapsed to one sentence. Saved ~400 tokens per command.
- Multi-format output templates with 90% overlap should be one base + deltas. If "JSON output," "markdown output," and "console output" list the same 12 fields, write one structured template and note format-specific variants as deltas.
- Persona reload reminders inside commands are redundant. Personas load once at session start. Reminders inside individual commands are dead weight.
- Skills are different — be careful what you trim. Framework specs with load-bearing prose and code (helper-class definitions, PPTX coordinates with EMU values) shouldn't be touched. Skills accounted for only 5% of savings.
Best practices for new commands:
- Default to terse. Most commands need 1.5K–2.5K tokens, not 5K+.
- Don't restate guardrails. They live in
core/rules/. Reference them. - Don't restate path resolution. The contract is in
core/rules/. - Auto-resolve fallback is one sentence.
- Multi-format outputs go in one template with conditional sections.
- Banner repetitions: once at the top, never repeat.
- Procedural specifics, code, and contract terms stay verbatim.
Methodology (reproducible):
- Baseline the corpus. A
tools/token-baseline.pywalksplugins/*/{commands,rules,skills}/*.md, counts tokens withtiktoken, emits CSV + XLSX with per-file detail, totals by type and plugin, top-20 sheet. - Identify the top offenders. Sort by tokens descending. Top decile concentrates the savings.
- Audit rules for byte-identical duplication. An
md5across plugin rule files surfaces collapse candidates. - Trim each targeted file. Cut redundant pre-flight prose, repeated path-resolution boilerplate, restated guardrails, banner repetitions, verbose multi-format templates. Keep procedural specifics, code, contract terms.
- Re-baseline after each batch. Label snapshots (
baseline,after-dedup,after-top7,final). Every snapshot is auditable.
"Lean prompts compound. Cluttered prompts compound differently."
Run quarterly, or any time the corpus grows by >10% without functional additions.
Selected AI engineering primitives
- Cross-platform OneDrive path resolver — disambiguates personal vs. organizational mount paths across macOS and Windows; survives the OneDrive rename/move dance that breaks most tooling.
- Prompt-contract framework — versioned input/output schemas per command, with contract violations surfacing in the developer console rather than degrading silently.
- Evidence-citation + confidence-disclosure guardrails — every claim in generated output must cite a dated source URL and a High/Medium/Low confidence rating; outputs refuse to generate if they can't satisfy the contract.
- Session-start hook that hot-loads operational knowledge — institutional memory becomes ambient without requiring a plugin release; the ops team can update guidance in a synced channel and the next session picks it up.
- Multi-provider AI infrastructure — Anthropic Claude (primary) with AWS Bedrock fallback; prompt caching for shared guardrails and knowledge base; per-command rule contracts; provider switching via env-var toggle.
:councilmulti-agent advisory pattern — pressure-tests recommendations through a 5-advisor council before reaching the user; surfaces dissent that single-agent flows hide.- Document generation pipeline — branded Microsoft Word output via AI-augmented markdown rendering; preserves enterprise template fidelity while generating from structured content.
Background
25 years in engineering leadership and platform work. Former CTO of a streaming platform (20+ person team, $4MM+ budget, 99.98% uptime, 100% YoY user growth, monolith-to-microservices migration). Six years as Senior Solutions Architect at AWS — co-authored theECS Workshop (canonical hands-on container workshop), featured speaker on theAWS re:Think Podcaston cost optimization.
UC Berkeley Executive Education — Professional Certificate in Machine Learning and Artificial Intelligence (2025). Harvard Data Science Review — Agentic AI Intensive (December 2025). AWS Certified Solutions Architect Professional + DevOps Engineer Professional.
Personal AI lab — NEXUS
Multi-machine personal R&D workspace prototyping patterns for client work. NAS + Mac-mini + laptop topology with ~7 production services (TypeScript/Node), 18+ LaunchAgents, a custom Slack bot with auto-discovered slash commands, semantic search over conversation history (Ollama embeddings), self-hosted dashboards behind Cloudflare Tunnels, and integrations with Anthropic Claude, Plaid, Microsoft 365, Synology DSM, GitHub, and Ghost CMS. Test bed for prompt-contract patterns, agent orchestration, hot-loaded knowledge, and evaluation harnesses deployed to enterprise engagements.
Public writing
- The Cloud Codex — daily-ish writing on AI engineering, platform work, agent orchestration, prompt caching, evaluation harnesses.
- What I Learned Building an Agentic AI Operating System for My Own Job — practitioner voice on shipping internal AI tooling.
- Agentic Application Modernization Reality — patterns applied to real modernization engagements.
- ECS Workshop — co-authored hands-on container training (AWS open source).
Inquiries: mike@mpt.solutions ·415-793-4717 ·Full résumé (PDF)
